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© Enhanced yeast transcription e mpl oy in g hybrid promoter region 

© Yeast promoters of glycolytic enzymes ate modified by 
isolating a fragment encompassing the- RNA polymerase 
binding site and Joining to the 5' end of this fragment e DNA 
sequence providing for enhanced inducible or constitutive 
transcription of a structural gene. Constructs ere prepared 
for efficient expression of foreign genes in yeast 

Yeast strains 2150-2-3(pO/1GAPSOD) and AB110<pO/ 
1GAPATO), producing human arantitrypsin and superoxide 
dismutase, were deposited at the A.T.C.C on May 9, 1984 
and given Accession Nos. 20708 and 20709. respectively: and 
2150-2-3(GAP5), 2150-2-3|Pyk5) and 2150-2-3{PHO5GAPU 
expressing Hepatitis B surface antigen, were deposited at the 
A.T.C.C. on May 9. 1984 and given Accession Nos. 20705, 
20706 and 20707, respectively. 
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ENHANCED YEAST TRANSCRIPTION EMPLOYING HYBRID 

PROMOTER REGION CONSTRUCTS 

BACKGROUND OF THE INVENTION 
Field of the Invention 

With the advent of hybrid DNA technology, 
production of mammalian proteins in microorganisms be- 
came a reality. For the first time, mammalian proteins 
could be synthesized in a unicellular microorganism by 
introduction of a gene encoding for the mammalian pro- 
* tein under the transcriptional and translational control 
of regulatory sequences recognized by the microorganism 
host. The introduction of these foreign constructions 
into a microorganism host resulted in competition be- 
tween the regulatory signals of the construct and the 
regulatory signal endogenous to the host for the host 
systems involved with expression. The structural gene 
of interest is usually directed to a product which is 
nonproductive and may be injurious to the host. There- 
fore, host cells which can turn off the foreign gene 
can effectively dominate modified host cells. 

Substantial progress has been made in isolat- 
ing sequences concerned with transcriptional and trans- 
lational regulation for protein expression. But fre- 
quently flanking sequences, as well as distant sequences, 
may also affect the efficiency and regulation of the 
expression of the protein. Therefore, as one manipulates 
these various sequences, removing them from their native 
environment, and joining them to unnatural sequences, 

* 

that is sequences other than the wild- type sequence, 
one can never be certain as to the result. 

In order to enhance the economies of producing 
proteins in microorganisms, there have been substantial 
efforts directed to improving the efficiency of tran- 
scription and translation,* maximizing the proportion of 
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total protein directed to production of the desired 
product, enhancing the viability of the modified host, 
as well as improving the efficiency with which the modi- 
fied host may be obtained. 
Pescription of the Prior Art 

Guarente et al. , Proc. Natl . Acad. Sci. USA 
(1982) 79:7410-7414, describes a hybrid promoter region 
employing the GAL4 regulatory region. Guarente and 
Ptashne, ibid . (1981) 78:2199-2203, report the presence 
of two domains in a yeast promoter, with a region up- 
stream from the TATA sequence providing an activation 
site. Kramer et al., ibid . (1984) 81:367-370, describe 
the regulated expression of a human interferon gene in 
yeast employing the yeast acid phosphatase promoter, 
where expression is induced by phosphate or a tempera- 
ture shift. Tekamp-Olson et al . , Cold Spring Harbor 
Meeting, Molecular Biology of Yeast , 1983, describe the 
absence of deleterious effects on yeast viability when 
employing 11 short" promoters, as distinct from the pres- 
ence of such effects, when employing an extended promo- 
ter region or "long" promoters. 

SUMMAR Y OF THF INVENTION 

Novel hybrid promoter regions are provided 
for use in conjunction with constructs having a struc- 
tural gene under the transcriptional control of the 
hybrid promoter region and a terminator region. The 
hybrid promoter comprises a first segment providing 
transcriptional enhancement, either constitutive or 
regulated, and a second segment defining an RNA polymer- 
ase binding site and transcriptional initiation site. 
The hybrid promoter regions lack the deleterious effects 
associated with a wild-type promoter region in recombi- 
nant constructs, which results in reduced transformation 
efficiencies and lower yeast viability. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a diagrammatic view of plasmids 

GAP1-6; 

Figure 2 is a diagrammatic view of plasmids 
Pykl-6; and 

Figure 3 indicates the DNA linker sequence 
and a flow diagram showing its use in a construct for 
hSOD. 

DESCRIPTION OF THE SPECIFIC EMBODIMENTS 

Novel DNA fragments are provided as well as 
constructions employing the fragments for enhanced ex- 
' pression of structural genes in a yeast host. The con- 
structs employing a hybrid promoter region provide for 
enhanced efficiencies of transformation and greatly 
improved viability of the yeast host as contrasted with 
those employing a wild- type yeast promoter. Concomitant 
with the improved viability is increased expression of 
a structural gene, in comparison with the truncated 
promoter region, and, therefore, greatly enhanced over- 
all yields of expression products. 

For the purposes of the subject invention, 
the "promoter region" is divided into two domains: 
(1) the structural gene proximal region, which includes 
the transcription initiation site, the "TATA 11 sequence 
capping sequence, as appropriate, and an RNA polymerase 
binding sequence, which sequence intends a sequence 
which includes nucleotides upstream from the initiation 
site for directing the initiation of synthesis of the 
messenger RNA; and (2) a distal region which provides 
for regulated or constitutive expression, with enhanced 

* 

expression as compared to the first domain linked to 
non- functional yeast DNA. 

The hybrid promoters of the subject invention 
employ the RNA polymerase binding region of a yeast 
glycolytic enzyme promoter and a region upstream from 
said polymerase binding region, which is different from 
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the wild-type upstream region of the SNA polymerase 
binding region and provides for enhanced efficiencies 
of transcriptidn. This distal region will be derived 
from either a sequence, usually a yeast sequence, in- 
volved in regulation of transcription, or a prokaryotic 
sequence which provides for enhanced constitutive ex- 
pression of the desired gene. 

Conveniently, cassettes or constructs can be 
prepared which provide for one or more restriction sites 
intermediate the promoter region and a related terminator 
region where the structural gene may be inserted, so as 
to be under the transcriptional control of the hybrid 
promoter region. By having one or more restriction 
sites, one can provide for ease of insertion of the 




tion and termination regions. The cassettes which can 
be prepared comprising the transcriptional initiation 
and termination region, having appropriate restriction 
sites for structural gene insertion can be cloned in 
prokaryotic vectors, so that after insertion of the 
structural gene, the resulting cassette, including the 
structural gene, may be cloned, isolated and purified, 
before introduction into a yeast vector. 

The cassette, will for the most part, have 

the following formula: 

-(P.R.(2)-P.R.(1))-R.S.-T.R.- 

wherein : 

P.R.(l) is the promoter region proximal to 
the structural gene and having the transcription initia- 
tion site, the RNA polymerase binding site, and includ- 
ing the TATA box, the CAAT sequence, as well as trans- 
lational regulatory signals, e.g., capping sequence, as 

appropriate; 

P.R.(2) is the promoter region joined to the 

5'-end of P.R.(l) associated with enhancing the 
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efficiency of transcription of the RNA polymerase binding 
region; 

R.S. is a sequence having one or more restric- 
tion recognition sites, preferably at least two restric- 
5 tion recognition sites, where the sites may result upon 
restriction into blunt ends or overhangs; 

T.R. intends the termination region, which 
will include the terminator, which may be a stem and 
loop structure, and which may be associated with one or 

10 more stop codons, a polyadenylation signal sequence, if 
any, as well as any other transcriptional and trans la- 
tional termination sequences. 

P.R.(l) will generally be at least about 150bp, 
more usually at least about 20 Obp, usually not more 

15 than about 600bp, more usually not more than about 500bp, 
generally not more than about 450bp and preferably less 
than about 40 Obp; the sequence will extend in the down- 
stream direction of transcription to about bp +3, more 
usually bp -1 and may extend only to bp -20, more usual - 

20 ly to bp -10 (the numbering intends that +1 is the first 
bp of the initiation codon with which the promoter re- 
gion is associated in the wild-type host, while -1 is 
the immediately upstream bp and the integers increase 
in the direction of transcription; 

25 P.R.(l) will be derived from a strong yeast 

promoter, normally a glycolytic enzyme promoter, such 
as glycer aldehyde- 3 -phosphate dehydrogenase, pyruvate 
kinase, alcohol dehydrogenase, phosphoglucoisomerase , 
triose phosphate isomerase, phosphof rue to kinase, etc.; 

30 P.R.(2) will provide for an enhancing function 

of transcription, which enhancing function may provide 

* 

for constitutive or regulated transcription; regulators 
will be derived from regions associated with regulation 
of yeast genes, other than the natural or wild- type 
35 gene associated with the first domain in the wild-type 
or natural host, such as UDP-galactose epimerase (GAL 10 ) , 
galactokinase (GAL1), acid phosphatase (PH05), alcohol 
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dehydrogenase I and II (ADH I and II), etc. For yeast 
regulatory sequences, the domain will usually be at 
least about lOObp, more usually at least about 200bp, 
for convenience generally not exceeding about 3kbp, 
usually not exceeding about Ikbp, desirably not exceed- 
ing about 600bp. The regulatory region will generally 
begin at least about 200bp from the initiation codon, 
usually at least about 300bp and may begin at 400bp or 
farther upstream from the initiation codon. 

Regulation can be as a result of a change xn 
the chemical or physical environment of the host, such 
as a change in carbon source, e.g., glucose to galactose 
or vice versa; a change in concentration of a nutrxent, 
e g. , an inorganic nutrient such as a phosphate; or a 
change in temperature, e.g., 25'C to 3S-C. Constitutive 
transcription can be achieved employing prokaryotxc 
sequences of at least about SOObp. usually lkbp or more, 
for convenience, generally not exceeding about 5kbp; 
conveniently, the prokaryotxc sequence can be obtaxned 
from the vector in which the cassette is cloned, xllus- 
trative vectors including pBR322, lambda, Charon 4A, 

P ACYC184, pUC5, etc. 

R.S. will generally be at least 4bp, more 
usually at least 6bp, and may be lOObp or more, more 
usually being not more than about 60bp and may include 
one or more, usually not more than about 10 restriction 
sites, where such restriction sites may be illustrated 
by EcoRI, BamHI, Sail, HindHI, Alul, Aval, Tagl, Hpal, 
etc., having at least one unique restriction site for 

the construct sequences . 

T.R. is the termination region which will 
include the "necessary transcriptional and translational 
signals for termination, such as the polyadenylation 

* 

site, etc.; 

T.R. will generally be at least about lOObp, 
more usually at least 150bp, and generally less than 
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about lkbp, usually less than about 600kbp; the termina- 
tion region may be derived from any convenient yeast 
sequence, so long as the terminator balances the promo- 
ter, conveniently being derived from a glycolytic enzyme 
terminator, where the terminator may be associated with 
the same or different enzyme with which the promoter is 



Where a cassette is cloned in a bacterial 
vector, the construction will have the following formula 

10 - ( p.R.(2)-P.R.(l))-R-S--T.R.-Rep(B)-(M(B)) a - 

I (-) 

b 

■ 

Ln all the symbols have been defined previously. 



wherein 
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20 



25 



30 



Rep (B), which intends a replicon or replica- 
tion system recognized by a prokaryotic host and may be 
derived from a plasmid or phage, such as ColEl, and R 
plasmid, e.g., pRK290, lambda, e.g.. Charon 4A, Xdv, 
etc.; 

M is a marker which provides for selection of 
hosts containing the construction, where (B) intends a 
prokaryotic, e.g., bacterial, host and a intends an 
integer of from 0 to 3 , usually 1 to 2 , although addi- 
tional markers may be present, where the marker allows 
for selection of the host containing the construct as 
well as providing for selective pressure on maintaining 
hosts having the construct? the markers include biocide 
resistance, such as antibiotic resistance, toxin resis- 
tance and heavy metal resistance; providing prototrophy 
to an auxotrophic host; providing immunity; and the 
like ; 

the markers may provide for complementation 
of an auxotrophic host, e.g., his", ura", trp , leu 
genotype, resulting in prototrophy; resistance to metals 
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such as cup* genotype; resistance to antibiotics, such 
as amp r , tc r , cam r , str r , tur* genotype, etc.; 

b is o" or 1 , intending that the construction 
is either linear or circular, usually circular. 
5 The above construct can be used for insertion 

of a wide variety of structural genes, both prokaryotic 
and eukaryotic, both naturally occurring and synthetic, 
where the genes may include signal leaders for secretion, 
and the like. The genes may express enzymes, hormones, 
10 proteins from pathogens for vaccines, structural proteins, 
lymphokines, membrane surface proteins, immunoglobulins, 
blood proteins, or the like. The particular structural 
• gene which is inserted is not critical to this invention 
and any polypeptide or protein of interest may be pre- 
15 pared employing the constructions of the subject inven- 
tion. The structural genes will usually be foreign to 
the yeast host, where foreign intends different from 
wild-type yeast structural genes and from a source that 
does not normally exchange genetic information with 

20 yeast. * 

Usually, the structural gene will be at least 

about 36bp, and not more than about 20kbp, usually not 
more than about 3000bp, usually not more than about 
1500bp. included in the structural gene may be non- 
25 coding flanking regions, the 5 '-flanking region normally 
being quite short, usually less than about 30bp, while 
the 3 '-flanking region may be extended, usually not 
exceeding about 500bp. Thus, the structural gene frag- 
ment will usually include the translational stop codons 
30 for proper termination of amino acid chain extension. 

When the structural gene has been inserted 
into the cassette which is joined to a yeast replication 
system, normally including one or more markers recog- 
nized by yeast, the resulting construct will have the 
35 following formula: 
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-(P.R. (2)-P.R.(l))-gene-T.R.-Rep(Y)-(M(Y)) - 
I _l 

wherein all of the symbols have been defined previously 
except for: 

gene, which intends the structural gene, having 
its initiation codon and stop codons as appropriate; 
and 

Y, which intends that the symbol is related 

to yeast. 

Convenient yeast replication systems include 
the 2\im plasmid replication system, combination of CEN3 
and ARS1 or ARS3, or the like. The replication systems 
may be high or low copy number, depending on the effect 
of the construct on the viability of the host. While 
the indicated replication systems are those which have 
found common employment, any replication system useful 
in yeast may be employed which provides for efficient 
replication and maintenance. Often the structural gene 
will be inserted into an appropriate shuttle vector 
capable of replication and selection in either a yeast 
or bacterial host, where the resulting construction 
will have the following formula: 

-(P.R.(2)-P.R.(l))-geae-T.R.-Rep(Y)-(M(Y)) -Rep(B)-(M(B)) - 

I ! Li 

where all symbols have been defined previously. Also, 
it is, of course, understood that the cassette without 
an inserted structural gene but containing the restric- 
tion enzyme recognition sequence, R.S., may be propagated 
in yeast or contained within a shuttle vector, where 
the construction will have the following respective 
formulae: 



-(P.R. (2)-P.R.(l) )-R.S.-T.R.-Rep(Y)-(M(Y)) - 

i s * I 
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-(P.R.(2)-P.R.(D)- R.S.-T.R.-Rep(Y)-(M(T)) a -Rep(B)-(M(B)) a - 
I ■ 1 

where all symbols have been defined previously. 

The various fragments which form the cassette 
and final constructions may be joined together in accor- 
dance with conventional ways. In many cases, genes 
have been isolated and restriction mapped, as well as 
sequenced. To that extent, one can select the sequence 
of interest by restriction of the gene, employing fur- 
ther manipulation as necessary such as resection with 
Bal31, in vitro mutagenesis, primer repair, or the like, 
to provide a fragment of a desired size, including the 
desired sequence, and having the appropriate termini. 
Linkers and adapters can be used for joining sequences, 
as well as replacing lost sequences, where the restric- 
tion site is internal to the region of interest. The 
various fragments which are isolated, may be purified 
by electrophoresis, electroeluted, ligated to other 
sequences, cloned, reisolated and further manipulated. 

The use of regulatory sequences for control- 
ling transcription of the structural gene of interest 
allows for growing the host cells to high density with 
no or low levels of expression of the structural gene, 
and then inducing expression by changing the environmen- 
tal conditions, e.g., nutrient, temperature, etc. 

For example, with the GAL4 regulatory region, 
the yeast cells could be grown in rich media with a 
glycerol-lactic acid combination to high density, e.g., 
mid or late log phase, followed by switching the carbon 
source to galactose. For PH05 regulation one could 
grow the cells at high phosphate, about 1 to lOmM, and 
then decrease the phosphate concentration to about 0.1 
to 0.5mM. For temperature sensitivity, one could grow 
the cells at 25° to 37°C and then change the temperature 
as appropriate by about 5° to 20°C. The host cells 
would have the regulatory system associated with the 
regulatory region employed. 
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Various techniques will be exemplified in the 
Experimental section of this application, which tech- 
niques can be used as paradigmatic for constructions 
employing fragments from sources other than those exem- 
plified. Of particular interest, as evidenced by the 
Experimental section, will be the use of the glyceralde- 
hyde-3 -phosphate dehydrogenase promoter region for the 
RNA polymerase binding site in conjunction with regula- 
tor sequences, such as those associated with GAL4, FH05, 
ADHII or the like* In referring to the GAL4 regulatory 
region or associated sequence, the region intends the 
sequence associated with regulation of other galactose 
metabolism genes, e.g., GAL1 and GAL10, which are under 
the regulatory control of such sequence in conjunction 
with the expression product of the GAL4 gene. The FH05 
and ADHII sequences refer to regions associated with 
the FH05 and ADHII genes which provide for transcrip- 
tional regulation of the PH05 and ADHII gene, respec- 
tively . 

The following examples are offered by way of 
illustration and not by way of limitation. 

EXPERIMENTAL 

All DNA manipulations were done according to 
standard procedures. See Molecular Cloning , T. Maniatis 
et al. , Cold Spring Harbor Lab., 1982. Enzymes used in 
cloning were utilized as per the manufacturer's speci- 
fications. Enzymes were obtained either from New England 
Biolabs or Bethesda Research Laboratories. Procedures 
with these enzymes employed the supplier's directions. 
Yeast were transformed and grown using a variety of 
media including selective medium (yeast nitrogen base 
without leucine); YEPD medium, containing 1% (w/v) yeast 
extract, 2% (w/v) peptone and 2% (w/v) glucose, and 
others as appropriate and/or detailed below. In the 
case of plating medium contained 2% (w/v) agar and for 
transformation 3% top agar. Hepatitis B surface antigen 
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was determined after lysis of yeast by glass bead agita- 
tion and clarification by centr if ligation, using the 
Ausriall assay (Abbott Laboratories). Protein is deter- 
mined by the Coomassie dye binding method. 

construction of GAL r egulator containing plasmids. 

Plasmid pLGSD5 is prepared as described in 
Guarente et al., (1982) supra . The plasmid was manipu- 
lated as follows: After restriction with Xhol, the 
overhangs were filled in with the Klenow fragment of 
DNA polymerase I ( "Klenow fragment" ) , ligated with BcoRI 
linkers (GGAAXTCC) and then completely digested with 
Ecori and Sau3A to provide a 370bp fragment which was 
isolated by gel electrophoresis and included the inter - 
genie sequence between GAL1 and GAL10 genes of yeast, 
and provides for the GAL4 regulation sequence of the 

GAL1 and GAL10 genes. 

This fragment was inserted into pBR322 which 
had been completely digested with EcoRI and BamHI , fol- 
lowed by treatment with alkaline phosphatase to prevent 
oligomerization. The resulting plasmid pBRGAL4 was 
treated in two different ways. 

In the first procedure pBRGAL4 was completely 
digested with Sau3A, the overhangs filled in with the 
Klenow fragment, and the resulting blunt-ended fragment 
ligated with Sai l linkers (CGTCGACG), followed by diges- 
tion with Sai l and Xhol . The resulting 370bp fragment 
was isolated by gel electrophoresis. This fragment has 
the original 370bp yeast GAL4 regulator sequence with 

Xho l and Sai l termini. 

The second fragment was obtained by complete 

digestion of pBRGAL4 with Xhol and Sail to provide a 
Xhol -Sal I fragment which included the 370bp yeast GAL4 
regulator sequence as well as about 280bp of pBR322, 
the GAL4 sequence extending from Sau3A to Sail. 
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The two fragments were then cloned in the 
plasmid plots, plots was prepared by inserting the 
40bp polylinker fragment of the following sequence 

EcoRI BamH I Boll I Xbal Hind i I I 

r — i i r i 

5 • AATTCCCGGGGATCCGTCGACCTGCAGATCTCTAGAAGCTTCAG 
3 • GGGCCCCTAGGCAGCTGGACGTCTAGAGATCTTCGAAGTC 

III I 
Smal Sail PstI PvuII 



into pBR322 as an EcoRI -PvuII substitution followed by 
insertion of the trp-lac promoter (Russell and Bennett, 
Gene (1982) 20:231-245) into the PvuII site with tran- 
scription oriented toward the polylinker sequence, 
plots was completely digested with Sai l, followed by 
treatment with alkaline phosphatase and the 370bp and 
650bp fragments independently' inserted into plots to 
provide plasmids plot5GAL4/370 and plot5GAL4/650 , re- 
spectively. Each of the plasmids was then completely 
digested with BamHI and Sai l to reproduce the individual 
fragments extended by 6bp of the polylinker fragment. 
These fragments were then ligated into pCl/1, which had 
been completely digested with BamHI and Sai l followed 
by treatment with alkaline phosphatase to prevent re- 
circularization. Plasmid pCl/1 is a derivative of 
PJDB219 (Beggs, Nature (1978) 275 :104) in which the 
region corresponding to bacterial plasmid pMB9 in 
pJDB219 has been replaced by pBR322 in pCl/1. The re- 
sulting plasmids were designated pCl/lGAL4/370 and 
PC1/1GAL4/650, respectively. The BamH I - Sai l fragment 
is located in the pBR322 portion of the vector pCl/1. 

The next construction develops a hybrid pro- 
moter for expression of the Hepatitis B surface antigen 
(HBsAg or sAg), employing the RNA polymerase binding 
region of GAPDH. The previously prepared plasmid 
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pHBS56/16-3, a yeast shuttle vector containing the al- 
cohol dehydrogenase 1 (ADH1) promoter, the HBsAg gene 
and ADH terminator as a Sph I fragment, was digested 
with Sph I and the ends modified with Bam linkers. The 
Bam linkers have the sequence CGGATCCG. 

pHBS56/16-3 was prepared as follows: A 
Taq l- Bpa l fragment obtained from the HBsAg coding region 
which included 26bp of the pre-sAg region, 681bp of the 
sAg region and 128bp of the 3 1 -untranslated region, was 
linked with EcoR I linkers and cloned at the EcoR I site 
in pBR322. The EcoRI linkers have the sequence GGAATTCC* 
The plasmid pHBS5 was thus obtained. 

After digesting pHBS5 with EcoRI , the digest 
was resected with Bal3 1 and religated with EcoR I linkers 
(GGAATTCC) ♦ After digestion with EcoR I the material of 
about 800bp was isolated from a po ly aery 1 amide gel* 
This isolate was then recloned into pBR322 which had 
been digested with EcoRI and treated with alkaline phos- 
phatase* Where the resection was to the sequence CATGG, 
which included the methionine codon, the EcoR I linker 
created an Nco l site. The plasmids were screened for 
the presence of an Nco l site and one of the plasmids 
chosen for further manipulation. This plasmid, desig- 
nated pHBS5-3, was restricted with EcoRI, the EcoRI 
fragment made blunt-ended with Klenow fragment and dNTPs, 
and the blunt-ended fragment was then restricted with 
Xba l to provide an about lOObp fragment having an Xbal 
overhang and blunt end at the former EcoR I site* 

pBBS5 was then digested with Cla l, made blunt- 
ended with the Klenow fragment and dNTPs, digested with 
Xba l, followed by alkaline phosphatase treatment. The 
lOObp fragment was then inserted into the vector to 
provide the plasmid pHBS6. Upon sequencing of the 
blunt-ended ligation site, it was found that an adeno- 
sine had been lost, so as to lose the EcoR I site, where 
the sequence was now ATCGATTCCCATGG . The Clal and Ncol 
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sites were retained. The loss of the A resulted in 
pHBS6 having a single EcoR I site. 

pHBS5-3 was digested with EcoR I and the re- 
sulting EcoR I fragment having the sAg fragment isolated 
5 by gel electrophoresis and inserted into the vector 
PHBS16 (Valenzuela et al., Nature (1982) 298:347-350). 
This plasmid has the ADB1 promoter and the sAg gene in 
an EcoR I fragment in a plasmid containing the 2\xm origin, 
a Trp I gene and pBR322. The plasmid was digested with 

10 EcoR I , treated with alkaline phosphatase to prevent 

recircularization, and the EcoR I fragment from pHBS5-3 
inserted to provide pHBS16-3, where the sAg gene iso- 
' lated as a Tag l- Hpa l fragment had been modified by Bal3 1 
resection. The plasmid pHBS16-3 was digested with Sph I 

15 and Xba l to provide a fragment which had the ADH promoter 
at the Sph terminus and the 5'-end of the sAg gene. 

pHBS56 was then digested with Sph I. pHBS56 
was prepared from pCl/1 by digestion with Sph I, which 
deletes a portion of the plasmid spanning the 2pm-pBR322 

20 joint. The active portion of the ADH1 promoter region 
is contained within the Sph l- Hind lll fragment of approx- 
imately 300bp (Bennetzen et al . , J. Biol. Chem. (1982) 
257 :301). The Sph I site in the ADH promoter begins at 
position -413 and the yeast terminator sequence is con- 

25 tained within a Hind i 1 1 - Sph I fragment of about 330bp. 
In each case the Sph I site is distal to the coding re- 
gion. A 1500bp ADH1 promoter fragment terminating at 
position -9 (Hitzeman et al. , Nature (1981) 293 ; 717) 
and an approximately 450bp terminator unit from nucleo- 

30 tides 913 to 1368 in the ADH gene nucleotide sequence 
were joined at a Hind i I I site between the fragments and 
cloned into the BamHI site of the vector YEpl3 (Broach 
and Hicks, Gene (1979) 8:121) to provide pADHS. 

The HBsAg-DNA segment of pHBSS was excised by 

35 EcoRI digestion, blunt-ended with the Klenow fragment 
and joined at both ends with Hindi I I linkers, C AAGCTTG . 
After digestion with Hind i I I , the HBsAg fragment was 



• • • 



0164556 

16 

inserted into the Hindi 1 1 site of the plasmid pADH5 
which had been .digested at the Hindi 1 1 site intermediate 
the ADH1 promoter and terminator sequence. A plasmid 
with the HBsAg gene in the correct orientation as deter- 
5 mined by restriction analysis was designated pHBS22 . 
The cassette was included between two SphI restriction 
sites. PHBS22 was digested with SphI to obtain a frag- 
ment of about 1500bp and inserted into SphI digested 
pCl/1 to provide pHBS56 which was cloned in E. cpli 

10 HB101- 

PHBS56 was digested with SphI and ttal to 
provide a l.lkb fragment having the ADH terminator re- 
• gion and the 3 » -portion of the sAg gene with the SphI 
site proximal to the terminator region. The l.lkb 
15 sphl-3toal fragment was joined to the Sphl-ttal fragment 
from pHBS16-3 , which resulted in providing the complete 
sAg gene in the correct orientation between the ADH 
promoter and terminator. This Sphl-SphI fragment was 
then ligated to SphI digested pHBS56, replacing the 
20 cassette of pHBS56 to provide the plasmid pHBS56/16-3 
with the resected sAg coding region fragment. The cas- 
sette was then excised from pHBS56/16-3 by digestion 
with SphI, followed by chewing back the overhangs with 
the Klenow fragment in the presence of dNTPs, then li- 
25 gated with BamHI linkers, followed by digestion with 

BamHI to provide a 1.6kb fragment which was isolated by 
gel electrophoresis. The fragment included the ADH 
promoter region, the sAg gene and ADH terminator region, 
as described above. This fragment was inserted into 
30 the BamHI site of pBR322 to provide pPGT16-3 which was 
digested with BamHI and Xbal and the resulting l.lkb 
fragment gel isolated, where the Jtoal-BamHI fragment 
had the 3' portion of the sAg gene and the ADH termina^ 
tor region. 

35 pHBS6 was digested with Xba l and Ncol and the 

94bp fragment gel isolated to provide the 5 '-portion of 
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the sAg gene. A synthetic adapter was prepared of the 

* 

formula 

CGA 2 TA 3 ( CA ) jTAgCAjCAA 
T, AT 3 ( GT ) 3 AT 3 GT 3 GTTGTAC 

5 having TagI and Nco l termini and providing the -25 to 
-1 nucleotides of the GAPDH (GAP49) promoter and the 
initiation codon of the sAg gene. This synthetic frag- 
ment, the Ncol-Xbal fragment, and the »aI-BamHI frag- 
ment were ligated simultaneously, followed by digestion 
10 with TagI and BamHI. The resulting fragment waB then 
substituted into pBR322 linearized with Clal and BamHI, 
• followed by treatment with alkaline phosphatase. The 
resulting plasmid, which contains the -1 to -25bp of 
the GAPDH promoter region, the sAg gene, and the ADH 
15 terminator, where the Kco l restriction site is lost was 
called pHBS6LGAPsAgtADH. 

pGAFl, a plasmid prepared by insertion of a 
Hindi 1 1 fragment containing the GAPDH gene GAP49 (Holland 
and Holland, J. Biol. Chem. (1979) 254:5466-5474) in- 
20 serted in the Hindi 1 1 site of pBR322, was digested with 
Hinfl and a 500bp promoter containing fragment isolated. 
The fragment was resected with Bal31 to remove about 50 
or 90bp, followed by ligation with Hindi 1 1 linkers and 
digestion with Hind lll. pBR322 was digested with 
25 Hindlll, followed by treatment with alkaline phosphatase 
and the about 450 or 410bp fragment inserted to provide 
pGAP128 and pGAP396, respectively. 

pGAP128 was digested with Hind lll, the fragment 
made blunt-ended with the Klenow fragment and dNTPs and 
30 the resulting 450bp fragment isolated by gel electropho- 
resis. This fragment was inserted into Smal digested 
plots, which had been treated with alkaline phosphatase, 
to provide plasmid plot5pGAP128 , which contained about 
-400 to +27bp of the GAPDH promoter and coding region. 
35 Plasmid plot5pGAP396 was prepared from pGAP396 in an 
identical manner and thus differs from plasmid 
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p lot5pGAP128 in having about 15-30 fewer bp at each 
terminus of the GAPDH promoter region (about -385 to 

Plasmids GAP1-GAP4 were then prepared in the 
5 following manner. Plasmid plot5pGAP128 was digested 

with TagI and BamHI to provide an about 390bp Tagl-BamHI 
fragment which included the -26 to about -400bp of the 
GAPDH promoter region and a portion of the Hindi 1 1 and 
plots polylinker. pHBS6LGAPsAgtADH plasmid was also 
10 digested with TagI and BamHI and a 1. 1Kb Tagl-BamHI 

fragment containing the 3 '-terminus of the GAPDH promo- 
ter region, the sAg gene and the ADH terminator region 
• was gel isolated and ligated to the other Tagl-BamHI 
fragment to provide a BamHI -BamHI fragment which included 
15 approximately 400bp of the GAPDH promoter region, the 

sAg gene in proper orientation for transcriptional regu- 
lation by the GAPDH promoter, followed by the ADH termi- 
nator region. This fragment was ligated into pBR322 
which had been digested with BamHI and treated with 
20 alkaline phosphatase to provide plasmid pPGTSO. This 
BamHI cassette could now be isolated and inserted into 
plasmid pCl/1. at the BamHI site in the pBR322 portion 
of pel A* where in plasmid GAP1 the ADH terminator region 
is proximal to the amp. r gene with the pBR322 portion 
25 divided into an approximately 4kb sequence including 

the amp. r gene and a 375bp region separating the cassette 
from the 2pm sequences. In GAP2, the promoter is adja- 
cent to the long pBR322 sequence with transcription in 
the same direction as the amp r gene. The same cassette 
30 was inserted into BamHI -digested P Cl/lGAL4/650 to obtain 
plasmids GAP3 and GAP4, where GAP3 has the GAPDH promoter 
distal from the GAL4 regulator region and the long pBR322 
sequence and GAP4 has the GAPDH promoter adjacent to 
the GAL4 regulator region, which is adjacent to the 

35 long pBR322 sequence. 

Plasmids GAPS and GAP 6 were isolated as follows 
Plasmid plot5pGAP396 was digested with Sail and TagI 
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and a fragment containing 9bp of the plots polylinker 
sequence and the GAFDB promoter sequence extending from 
about -385 to -26bp was isolated. An approximately 
130bp Tagl-Xbal fragment including -25 to -lbp of the 
GAPDH promoter and +1 to +93bp of the sAg gene was ob- 
tained from pHBS6LGAPsAgtADH . A l.lkb Xbal -Sail fragment 
containing the 3 ' -portion of the sAg gene and the ADH 
terminator as well as 6bp of plots polylinker sequence 
was obtained from plasmid plot5sAgtADH (described below- 
— Pyruvate Kinase Promoter). These three fragments 
were ligated, digested with Sail and then cloned into 
Sail-digested pCl/lGAL4/370 . GAP5 has the GAPDH promo- 
• ter region adjacent to the GAL4 regulator region, which 
is proximal to the short pBR322 sequence, and GAP6 has 
} the GAPDH promoter region distal from the GAL4 regulator 
region and proximal to the long pBR322 sequence (see 
Fig. 1). 

Pyruvate kinase promoter . 

Plasmid pHBS6Pyk containing the sAg gene under 
0 the transcriptional regulatory control of the Pyk promo- 
ter was obtained by cloning a 4.4kb insert of yeast 
genomic DNA in pBR322 containing the Pyk gene and 911 
nucleotides of 5 • -untranslated region, and digestion of 
this plasmid pPyk9.1.1 with Xbal. After making the 
25 ends blunted-ended, the linear fragment was digested 
with BamHI providing a 912bp BamHI-blunt fragment con- 
taining the Pyk promoter and 8 bases from the Pyk coding 
region. This fragment was inserted into the plasmid 
pHBS6, which had been digested with Ncol, blunt-ended 
30 and digested with BamHI . The plasmid pHBS6Pyk was to- 
tally digested with EcoRI , to obtain a fragment includ- 
ing the sAg gene and a portion of the Pyk promoter re- 
gion. The fragment was made blunt-ended with the Klenow 
fragment and dNTPs, followed by ligation to BamHI linker? 
35 digested with Xbal, which is internal to the sAg gene, 
the Xbal terminus made blunt-ended with the Klenow 
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fragment and dMTPs, followed by digestion with BamHI, 
to provide a 580bp BamHI -blunt-ended (ttal) fragment 
The plasmid plots was digested with EcoRI, made blunt- 
ended, digested with BamHI and treated with alkaline 
phosphatase and the two fragments joined to provide 
plasmids plotSPyksAgSl and plotSPyksAg . 57 . The two 
differ in that the BainHI site of the latter was not 
regenerated during cloning, possibly as a consequence 
of minimal nuclease contamination (digestion) . 

plots was treated as previously described 
(EcoRI digestion, blunt-ended, BamHI digestion and treat- 
n^t with alkaline phosphatase) and joined to a l.lkb 
fragment obtained by digestion of pPGT16-3 with Xbal. 
followed by blunt ending, followed by digestion with 
BamHI and gel isolation. This fragment was introduced 
So plots to provide the plasmid plots sAgtADH. Again 
the BamHI site in this plasmid was not regenerated, 
presumably due to digestion by contaminating nuclease. 

Plasmids Pykl and Pyk2 were prepared as fol- 
lows Plasmid plotSPyksAgSl was digested with BamHI , 
then with Jtoal, and an approximately 580bp fragment 
containing about 480bp of Pyk promoter and 93bp of the 
5 ,_end of the sAg gene was gel isolated. A l.lkb Jtoal- 
sall fragment containing the 3 '-portion of the sAg gene, 
^ am terminator and about 6bp of the plots polylinker 
was isolated from plotSAgtADH. These two fragments 
were ligated, digested with BamHI and Sail and then 
cloned into plasmid pCl/1, which had been cleaved with 
BamHI and Sail and treated with alkaline phosphatase, 
^Tyield plasmid Pykl. Plasmid Pyk2 was prepared simi- 
larly but the 580bp Sall-Jfcal, Pyk promoter/HBsAg gene 
5'-end fusion fragment was isolated from plot5PyksAg.57 
and included about 6bp of plots polylinker sequence 
upstream from the promoter region. Also the l.lkb fifl- 
BamHI fragment containing the 3 ■ -part of the HBsAg gene 
Ind the ADH terminator was derived from plasmid pPGT16-3 

Plasmids Pyk3-Pyk6 were prepared as follows. 
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Plasmid plot5PyksAg51 was digested with BamHI, then 
with Xbal and the about S80bp fragment containing the 
pyk promoter and the 5 '-part of the HBsAg gene isolated 
as above. The l.lkb BamHI -Xbal fragment, containing 
5 the 3' -portion of the HBsAg gene and ADH terminator, 
was recovered from pPGT16-3, also as above, and the two 
fragments ligated, digested with BamHI and inserted 
with different orientations into the BamHI site of 
PC1AGAL4/650 (Pyk3, Pyk4). Plasmids Pyk5 and Pyk6 
10 were prepared similarly except that the Sall-Xbal frag- 
ment containing the Pyk promoter and 5* -end of the sAg 
gene was isolated from plot5PyksAg.57 and the Xbal -Sal I 
• sAg gene 3 • -portion/ADH terminator fusion fragment was 
derived from plotSsAgtADH and thus both fragments in- 
15 eluded approximately 6bp of plots polylinker sequence. 
The cassette so formed was then cloned into the Sail 
site of pCl/lGAL4/370 in opposite orientations. 

The six plasmids designated Pykl-6 (see Fig. 
2) are distinguished by Pykl having the promoter region 
20 proximal to the short pBR322 sequence; Pyk2 having the 
promoter region proximal to the long pBR322 sequence? 
Pyk3 having the promoter region proximal to the short 
pBR322 sequence and distal from the GAL4 sequence; while 
Pyk4 has the promoter region proximal to the GAL4 region, 
25 which in turn is proximal to the long pBR322 sequence; 
Pyk5 has the promoter region proximal to the GAL4 region 
which is proximal to the short pBR322 sequence; while 
Pyk6 has a promoter region distal from the GAL4 region 
and proximal to the long pBR322 sequence. 
30 These plasmids described above were trans- 

formed into S. carlsbergensis strain 2150-2-3 (available 
from Lee Hartwell, University of Washington) under con- 
ventional conditions (Hinnen et al. f Proc. Nat l* Acad. 
Sci. USA (1978) 75:1929-1933). Cultures of 50-150ml 
35 were grown to mid or late log phase in rich media (YEP) 
under neutral conditions (3% glycerol, 2% lactic acid), 
and then inducing conditions (+2% galactose), or 
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repressing conditions (+2% glucose) for the final 1-2 
generations. After lysis with glass beads and clarifi- 
cation of the supernatants by centrifugation, HBsAg 
expression was determined as described above. The re- 
sults for the 12 plasmids are set forth in the following 
Table 1. 
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Construction of pPGAP , 

A yeast expression vector was prepared called 
pPGAP having a polyrestriction site linker between the 
GAPDH terminator and short promoter region. Plasmid 
5 plot5pGAP128 was digested with BamHI and Taq I to yield 
an approximately 390bp BamH I " Taq I fragment having the 
-400 to -26bp of the GAPDH promoter. The BamH I - Taq I 
fragment was ligated to a synthetic fragment having the 
following sequence: 

10 CGAjTAg ( CA ) gTAgCAgC^CCATGgAjTjCG^AC^ 

T 2 AT 3 (GT) 3 AT 3 GT 3 GTGGa»C 3 T 2 A 2 GCA 2 TC 2 AGCT 

to provide a BamHI -Sal I fragment , which was digested 
with BamHI and Sai l and used to replace the BamHI -Sal I 
fragment of BamHI -Sal I digested pBR322 treated with 

15 alkaline phosphatase. After ligation, the plasmid 

pGAFNRS was obtained which was digested with BamH I and 
Sai l to provide a 400bp BamHI -Sail I fragment which was 
gel isolated. This fragment was ligated to an about 
900bp Sal I - BamH I fragment containing the GAPDH termina- 

20 tor region and a short segment of 3' coding region and 
the resulting 1 .4kb BamH I - BamH I fragment digested with 
BamH I. The Sal I - BamH I GAPDH terminator fragment was 
obtained by Sai l and BamH I digestion of pGAP2, a plasmid 
prepared by insertion of an about 3.3kb BamH I fragment 

25 containing the GAPDH gene GAP49 (Holland and Holland, 
supra ) into the BamH I site of pBR322. Plasmids pGAP2 
and pGAPl were obtained as follows: A yeast gene li- 
brary was prepared by inserting fragments obtained after 
partial digestion of total yeast DMA with restriction 

30 endonuclease Sau3 A in lambda -phage Charon 28 (Blattner 
et al. , Science (1977) 196:161-169). The phage library 
was screened with DNA complementary to the yeast GAPDH 
mRNA and the yeast GAPDH gene from one of these clones 
was subcloned as either an about 3.3kb BamH I fragment 

35 in the BamH I site of pBR322 (pGAP-2) or as an about 
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2.1kb Hindi 1 1 fragment in the Hindi 1 1 site of pBR322 
(pGAP-1). 

pBR322 was digested with EcoR I and Sai l, the 
termini blunt-ended and ligated to BamH I linkers, fol- 
lowed by BamH I digestion and the BamH I - BamH I 3. 8Kb frag- 
ment gel isolated, reci regularized by self-ligation, 
cloned and designated pBRARl-Sal. The 1.4kb BamH I - BamH I 
fragment was inserted into the BamH I -digested , alkaline 
phosphatase treated pBRARl-Sal vector to provide the 
plasmid pFGAP of about 5.3kb with the orientation in 
the opposite direction of the amp r . 

The plasmid phSOD was prepared as follows: 

Molecular cloning of hSQD cDNA . 

Total RNA was prepared from an adult human 
liver by the guanidinium thiocyanate/lithium chloride 
method (Cathala et al., DNA (1983) 2:329-335). polyA 
RNA was used to synthesize double-stranded cDNA (Haniatis 
et al. , Molecular Cloning , 213-242, Cold Spring Harbor, 
1982) and this was passed over a Sepharose CL4B column 
to enrich for cDNAs of greater than 350bp (Fiddes and 
Goodman, Nature (1979) 281 :351-356) . The cDNA was in- 
serted at the Pst I site of plot 4, a pBR322 derivative 
having the following sequence replacing the Pst I - EcoR I 
site. 

PstI Hinfl Alu l 
1 6GTGAATCCGTAATCAT66TCATA6CT6TTTCCT6TGTGAAATTGTTATCC6CTC 
ACGTCCACTTAGGCATTAGTACCAGTATCGACAAAGGACACACTTTAACAATAGGCGAG 
SEhl 

Hind I I Alul 

60 ACAATTCCACACATTATACGAGCCGATGATTAATTGTCAACAGCTCATTTCAGAATATTT 
TGTTAAGGTGTGTAATATGCTCGGCTACTAATTAACAGTTGTCGAGTAAAGTCTTATAAA 

EcoRI 

120 GCCAGAACCGTTATGATGCGG 

CGGTCTTGGCAATACTACGCCTTAA 
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The cDNA insertion employed the oligo-dG:dC tailing 
method (Haniatis et al., supra ). B. coli strain D1210 
was transformed with this mixture and trans formants 
selected on L-agar containing 10ug/ml tetracycline 
(Kushner, S.R. (1978) In: Genetic Engineering, eds. 
Boyer, H.B. and Nicosia, S., (Elsevier/North Holland, 
Amsterdam) p. 17). Plasmid DMA constituting a liver 
cDNA library was prepared (Haniatis et al . , Molecular 
cloning , pp. 86-94, Cold Spring Harbor 1982) directly 
from approximately 62,000 recombinant colonies plated 
at a density of approximately 3,000 colonies per 9cm 
diameter Petri dish. 

Isolation of r-hSOD clones . 

Strain D1210 was retransformed with the liver 
cDNA library and about 40,000 clones were grown on nine 
14cm diameter Petri dishes. After transfer of the col- 
onies to nitrocellulose paper and chloramphenicol ampli- 
fication of plasmid SNA, the cells were lysed and the 
filters prepared for hybridization ( I sh-Horowicz and 
Burke, Nucleic Acids Research (1981) 9:2989-2998). 
Oligonucleotide probes were employed for screening by 
hybridization, with the probes consisting of enzymatic- 
ally-radiolabeled, chemically-synthesized DNA molecules 
complementary to the mRNA encoding amino acid residues 
19 to 24 of the protein (Jabusch et al. , supra . ; Barra 
et al. , supra . ) ; the mixture had the following sequences : 

3' TTA AAA CTT GTT TTT CT 5 1 
G G C C C 

where all of the indicated possibilities for encoding 
the peptide sequence were prepared (32-fold degenerate). 

The probes were labeled with P to a specific 
activity of l-3xl0 8 cpm/pg and Millipore (0.45(jm) filtered 
before use. Filters were prehybridized for 6hrs at 
30°C in 4x SSC, 2x Denhardts»s solution, 40mM sodium 
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phosphate, pH 7.5, 300pg/ml sonicated salmon testes 
DNA. Hybridization was for 20hrs at 30°C in the same 
solution containing 2xl0 6 cpm/ml hSQD DNA probe (residues 
19-24). Filters were washed in 4x SSC, once for 15m in 
5 at r.t. and twice for 15min at 30°C, blotted dry and 
autoradiographed with an intensifying screen for 24hrs 
at -70°C. 

Areas on the master plates that corresponded 
to duplicate positive signals were picked into L-broth 

10 and plasmid DNA prepared by the miniscreen procedure 

(Mania tig et al.. Molecular Cloning , 178, 368-369, Cold 
Spring Harbor 1982). This DNA was cut with Pst I and 
' subjected to Southern blot analysis (Southern, J. Hoi. 
Biol . (1975) 98:503-517) hybridizing initially with the 

15 previous labeled probes (amino acid residues 19-24) and 
then with additional radiolabeled probes derived from 
amino acid residues 109-114 and having the following 
sequences (all possible variations, 72-fold degenerate) 
present as a mixture: 

20 3' CTA GTA ACA TAA TAA CC 5' 

G G G G G 

T T 



One plasmid pool (pSODl) contained a cDNA insert of 
520bp that hybridized with both probes and after colony 

25 purification, plasmid DNA was prepared from this clone 
and sequenced by the method of Maxam and Gilbert ( Proc. 
Natl. Acad, Sci. USA (1977) 74:560-564). The hSOD cDNA 
clone pSODl constitutes the coding region for amino 
acids 10-153 of hSOD, a single translational stop codon 

30 and a 3 1 untranslated region. Therefore, in the expres- 
sion vector construct, the base sequence of the region 
encoding amino acids 1-9 is derived from the published 
amino acid sequence of hSOD (Jabusch et al . # supra ; 
Barra et al . , supra ) and synthesized chemically as a 
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part of the variable linker segment (see discussion 
relating to Fig. 3). 

rv^+^tion »lotS d«ri «rt*v— containing r-hSOD . 

The synthetic DNA molecules F(26), C(16), 
B(31), D(ll), E(13) and 4(24) shown in Fig. 3, were 
synthesized by the phosphoramidite method. 

The single strand 4(24) was prepared by usxng 
all four bases, at each site where X is indicated. 
Furthermore, silica was withdrawn from the synthesis of 
the 24mer, such that single-stranded 21mers, 22mers, 
and 23mers are obtained in addition to the 24mers. 
After removal from the silica support, the four mixtures 
are combined in appropriate proportions to provide for 
equimolar amounts of each of the possible single strands . 
This mixture was treated as a single product in the 

subsequent steps. 

Molecules F(26), C(16), B(31) and D(ll) were 
mixed together in equimolar amounts and lOug phosphory- 
lated using T4 polynucleotide kinase. After phenol-ether 
extraction, the additional non-phosphorylated synthetic 
DNA molecules 4(24) and E(13) were added, such that all 
fragments were equimolar. The equimolar mixture con- 
tained 13ug of DNA in 133ul of 0.3x kinase buffer. 

After annealing by cooling at a uniform rate 
from 70°C to 20°C over 60min, the single strands were 
ligated together with T4 ligase in 200ul ligation mix 
at 14°C for 4hrs, phenol-chloroform extracted, ethanol 
precipitated and the 5 '-ends of 4(24) and E(13) phospho- 
rylated using T4 polynucleotide kinase (Maniatis et 
al . , supra ) . Preparative polyacryl amide gel electro- 
phoresis was used to isolate the completely ligated 
53bp material having 5'- and 3 • -overhangs . 

The above purified fragment mixture was then 
ligated to the 460bp Tagl-Pstl segment of the hSOD cDNA 
as shown in Fig. 3. This segment was itself constructed 
by isolating the 454bp Tasl-Alul hSOD fragment, making 
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it flush-ended using Klenow and inserting it into plots 
between its EcoRI and Sai l sites which had been simi- 
larly made flush-ended. After preparation of plasmid 
DNA from this recombinant, the 460bp Taq l- Pst I hSOD 
5 fragment was isolated by preparative poly acryl amide gel 
electrophoresis. After extraction and precipitation , 
the 515bp fragment resulting from the joining of the 
synthetic fragment to the 460bp Taql-PstI hSOD fragment 
was blunt-ended (525-528bp) and then digested with Sai l 

10 and the resulting 519-522bp hSOD fragment isolated by 
poly acryl amide gel electrophoresis. This fragment was 
then inserted into plots which had been digested with 
• PvuII and Sail and then treated with alkaline phospha- 
tase. The resulting plasmids were used to transform 

15 strain D1210. Recombinants obtained after transforma- 
tion of strain D1210 were selected on L-agar containing 
lOOpg/ml ampicillin to give a set of clones, which were 
screened for an Hco l site. One was selected and desig- 
nated phSQD. 

20 Construction of a yeast vector for SOD expression * 

The plasmid phSOO was digested with Nco l and 
Sai l and a 550bp fragment obtained, which included 1 
nucleotide untranslated at the 5 '-terminus and the entire 
coding region for hSOD. pPGAP was digested with Hco l 

25 and Sai l followed by treatment with alkaline phosphatase 
and the Sal I -Ncol fragment substituted for the Ncol -Sal I 
fragment in pPGAP to provide pPGAPSOD. BamH I digestion 
of pPGAPSOD resulted in a 2kb fragment which was gel 
isolated and inserted into the BamH I site of pCl/1 and 

30 pCl/1 GAL4/370. These plasmids were transformed into 

* 

yeast strain 2150-2-3 as described previously, with the 
results of expression set forth in the following Table 
2. 
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Table 2: Expression of Human SOD in Yeast Strain 2150 



Plasmid 


Carbon Source 


SOD 2 


5 




Mg/mg protein 


pCl/1 


g. i 1 


0 


pCl/lGAFSQD 


g» L 


148 


pCl/lGALGAPSOD 


g> L 


0.4 


10 


gal 


68 



1 All cultures grown in Minus Leucine media with 2% lactic acid, 
3% glycerol with or without 2% galactose to late log or early 

15 .stationary phase. 

2 Determined by RIA. 



hSOD levels were measured using a standard radioimmuno- 
assay with iodinated authentic hSGD as standard. Con* 
20 stitutive synthesis from the GAP promoter leads to very 
high levels of hSGD production, of the order of 10-30% 
of the total cell protein. The induction with galactose 
works almost as well, yielding about 7% of the cell 
protein as hSOD. 

25 Construction of pYASIl 

This yeast expression plasmid contains the 
hSOD gene fused to the amino terminus of the human pro- 
insulin gene, with a methionine codon at the junction 
between both genes. The fusion gene is under control 

30 of the hybrid inducible ADH2-GAF (yeast alcohol dehydro- 
genase 2) promoter and the GAP terminator. An about 
3kbp BamHI expression cassette was constructed. 

The ADH2 portion of the promoter was con- 
structed by cutting a plasmid containing the wild type 

35 ADH2 gene (plasmid pADR2, see Beier and Young, Nature 

(1982) 300:724-728), a pBR322 derivative, which contains 
the yeast centromere gene from chromosome 3 (CEN3) (U.S. 
Patent No. 4,464,472), the Trp l gene and the yeast ADHII 
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gene with the restriction enzyme EcoRS, which cuts at a 
position +66 relative to the ATG start codon, as well 
as in two other sites in pADR2, outside of the ADH2 
region in the CEN3 and Trp l gene. The resulting mixture 
5 of a vector fragment and two smaller fragments was re- 
sected with Bal3 1 exonuclease to remove about 300bp. 
Synthetic Xho l linkers were ligated onto the Bal31 
treated DNA. The resulting DMA linker vector fragment 
was separated from the linkers by column chromatography, 
10 cut with the restriction enzyme Xho l, religated and 
used to transform E. coli to ampicillin resistance. 
The positions of the Xho l linker additions were deter- 
' mined by DMA sequencing. One plasmid which contained 
an Xho l linker located within the 5 1 non- transcribed 
15 region of the ADH2 gene (position -232 from ATG) was 
cut with the restriction enzyme Xho l, treated with nu- 
clease SI, and subsequently treated with the restriction 
enzyme EcoR I to create a linear vector molecule having 
one blunt end at the site of the Xhor linker and an 
20 EcoR I end at the pBR322 proximal terminus. 

The GAP portion of the promoter was constructed 
by cutting plasmid pPGAF (supra) with the enzymes BamH I 
and EcoR I , followed by the isolation of the 0.4Kbp DMA 
fragment. The purified fragment was cut with the enzyme 
25 Alu l to create a blunt end near the BamHI site. 

Plasmid pJS014 was constructed by the ligation 
of the Alu l - EcoR I GAP promoter fragment to the ADH2 
fragment present on the linear vector described above. 

Plasmid pJS104 was digested with BamH I (which 
30 cuts upstream of the ADH2 region) and with Nco l (which 
cuts downstream of the GAP region). The about 1.3Kbp 
fragment containing the ADH2-GAP promoter was gel puri- 
fied and ligated to an about 1.7Kbp fragment containing 
the hSOD-proinsulin fusion DNA sequences and GAP termi- 
35 nator present in pYSIl. This 3Kbp expression cassette 
was cloned into BamH I digested and phosphatase treated 
pCl/1 to yield pYASIl. 
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Plasmid pYSIl was constructed as follows. 

„ fragments were employed which involve a 454bp 

NcoI-Sau3A isolated from phSOD, where the fragment in- 
cudes the entire coding sequence for human superoxide 
dismutase (hSOD) with the exception of the last three 
3.- codons; a 51bp Sau3 A-Hindl 1 1 synthetic adapter, 
which codes for the last three codons of hSOD, methio- 
nine, and the first 14 codons of proinsulin; and a 231hp 
Hindlll-Sall fragment, isolated from pINS5 (a pBR322- 
lerived vector which contains a proinsulin coding se- 
quence chemically synthesized according to the amino 
acid sequence reported by Bell et al.. Nature (1979) 
282:525-527), which encodes proinsulin excepting the 
first 14 amino acids. These fragments were ligated 
together and introduced into the plasmid pPGAP, which 
had been previously digested with Ncol and Sail and 
alkaline phosphatase treated. The resulting plasmid 
pSU was digested with BamHI to provide an expression 
cassette which was cloned into plasmid pCl/l to yield 

pYSIl. 

pinnin g of alpha- 1- antitrypsin. + 

A cDNA library was made from lOug of polyA 
SNA isolated from a part of a human liver. This library 
was prepared by oligo-dl priming of the first cDNA strand 
and self-priming of the second cDNA strand. The ds 
cDNA was size fractionated on a Sepharose CL4B column 
and those molecules greater than 300bp isolated. This 
fraction was treated with nuclease SI and tailed with 
dCTP, using terminal transferase- The tailed cDNA was 
annealed to pBR322 which had been digested with Pstl 
and tailed with dGTP. Transformation of E. coli HB101 
yielded 60,000 colonies, where greater than 90% of the 

clones were recombinant. 

Two synthetic oligonucleotide probes were 
used to isolate the alpha-l-antitrypsin ( 0] -AT) cDNA, 
the first probe corresponding to amino acid residues 
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344-350 near the C- terminus of the protein was used to 
probe 5,000 colonies and the second probe, corresponding 
to amino acid residues -23 to -17 (+1 being the first 
nucleotide of the first codon of the mature fl^-AT) of 
the signal peptide, was used to probe 25,000 colonies. 
The probe sequences were taken from the partial nucleo- 
tide sequence described by Kurachi et al . , Proc. Natl. 
Acad. Sci. USA (1981) 78:6826; Leicht et al. , Nature 
(1982) 297:655). Approximately 3% of the colonies hy- 
bridized to the c-terminal probe and four hybridized to 
the M- terminal probe. The four N- terminal clones and 
12 C-terminal clones were isolated and subjected to 
' restriction analysis. From these, three overlapping 
clones which cover the entire cDNA were subjected to 
further study and were used to construct the full length 
CDMA clone. 

The entire sequence of a composite full length 
cDMA derived from the three plasmids is as follows: 
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Met Pro Ser Ser 
GGGGGGGCGGACGGTAATCUACA ATG CCC TCT TCT 



-1 
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61 

41 
121 

61 
181 

81 
241 
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301 

121 
361 

141 
421 
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431 

181 
5*1 



201 
601 



221 
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901 
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61 
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Met Serine 
ATG TCTlATC 
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ao« n,, r v+ r.iv Thr Glu Ala Ala Gly Ala Met Phe Leu Glu Ala He Pro 
gSc cic AAA Gc£ ACT GAA OCT OCT GGG GCC ATG TTT TTA GAG GCC ATA CCC 

pi*. t wa Phe Aan Lvs Pro Phe Val Phe Leu Met lie Glu Gin Asn Thr Lya 
CCC CCC GAG CTC aXg TTC AAC ffl CCC TTT GTC TTC TTA ATG ATT GAA CAA AAT ACC Aic 
— — Aval 

?S EE? CTC ??C I?J I'd iS So & U? CS°C IS III kl! ?Sa CTCCCTCTCCCTCCTCAU 

"* ~* ~ — ™ Hint 1 

AAT CCC ACC CAA AAA TAG 

GGG TGG GTT TTT ATC AGCT 

Salt 

ccctcccctccatccctgcccccctccctcgatgacattaaagaagggttgacctgcaaaaaaaaaaaaaaaaaaaaa; 



35 



0164556 



LEGEND 

Nucleotide and predicted amino acid sequences 
of a., -AT CDNA. The reactive center met-ser at 
positions 358-359 is boxed. Subscripts to 
amino acids in parentheses identify differences 
between the subject protein sequence and those 
derived from (a) protein sequencing (Carrell 
et al., 1982), (b) the cDNA of Woo et al . , 
f£ee"Carrell et al., 1982]), and (c) the cDNA 
of Bollen et al . , 1983 . The synthetic DNA 
molecules used in the construction of the 
BamHI to Sai l fragment encoding the mature 
protein are shown as are the cDNA restriction 
sites used in this construction. 

The above sequence was determined using the 
dideoxy sequencing method of Sanger et al. r Proc. Natl. 
Acad. Sci. USA (1977) 74:5463, in the M13 vectors of 
Messing et al., Nucleic Acids Res. (1981) 9:309. The 
differences at the nucleotide and amino acid level from 
the published cDKA sequences are shown. 

Construction of the full length clone for 
expression of yeast began with three fragments isolated 
from cDNA clones: 1) a 630bp BamHI-BstEII fragment; 2) 
a 450bp BstEII- Ava l fragment; and 3) an 85bp Aval-Hinfl 
fragment. A synthetic adapter was employed having the 
following sequence: 

A 2 TC 3 AC 3 A 5 TAG 

GgTGgTgATCAGCT 

Approximately two pmoles of fragments 1 and 2 were li- 
gated together and after removal of the ligase, digested 
with BamHI and Ava l, Fragment 3 and the synthetic adap- 
ter were ligated and digested with Aval and Sail and 
the two resulting fragment mixtures were ligated followed 
by digestion with BamHI and Sail. Fragments migrating 
upon electrophoresis in the region of about 1000-1400bp 
were isolated and cloned by substitution into BamH I and 
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sail digested and alkaline pho.ph.tM. treated pB*322. 
^ oiasmid is referred to as pATi. 

The "^"^arid PPCAP was digested with Kcol . followed 
h„ Mim t-ending. followed by Sail digestion and treat. 
m L t TiL af,»Une pno.ph.tase. The weoX-S^ fragment 
^substituted wit. an approximately^* fended 
<BamBI)-salI fragment obtained from plasmid pAXi. by 

diction, blunt ending, and Sail digestion. 
Jii was inserts into the pPGAP vector to Produce the 
plasmid pGAPATi. a 6.6W. plasmid. which was digested 
with Hcol and gsmHI and a 2.3kb HeoI-BjfflHI fragment 
witn »=_ 7 md tj,, OAPDH terminator 

obtained having the Oj-ax gene ana 

AOObp BamHI-NcoI fragment obtainea 




having the GAPDH promoter. These fragment 
5 together and inserted into the BamHI site of PO^- 

Tn* plasmid, pCl/lG»ATi8 and P cl/lG»ATi9 war. obtained 
with the orientation of expression clockwise in th.^ 
former and counterclockwise in the latter. with he 
Ling in the counterclockwise direction. ^ 
,0 were ^transformed in s. cerevisiae AB103 (A.T.C.C. No. 
^688, depoaited January 5. 1983, by standard methods 
selecting for leucine prototrophy and 
above. «.st extracts were prepared by lysis with gl 
beads and the a^-AT activity determined by inhibition 

25 of human leukocyte elaBtase. . q 

asmvs contained in 1ml: 0.1-0. 2 human leukocyte 
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elastase (SLE); .O.lmM »^^ U ^!^^ y 
anilide (Beatty et al., t irtol. chem. (1980) 25S.3931), 
T^ TJs. p= H.C1. and the indicated amounts 

of veast extract or human «,-AT. Assays were initiated 
^ £ edition of elastase* incubated at 
terminated by the addition of 100,1 of 8H acetic acid 
and the absorbance at 410n» determined. Typical results 
are shown in the following Table 3. 
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The above data demonstrate that plasmids having the 
orientation of the expression cassette in the counter- 
clockwise orientation, the promoter proximal' to the 
long sequence of pBR322, make 10-20 times more a^AT 
than the same cassette in the other orientation. 

ypast strain AB110 . 

Yeast strain 2150^2-3 was crossed with a yeast 

strain AB103 transformant containing P Cl/lGAPATi9 . The 
diploids were sporulated and the tetrads dissected. 
Strains were maintained on leucine selective plates in 
order to ensure maintenance of the plasmid. since the 
parents are auxotrophs. A series of colonies were 
screened for their genotype with respect to a number of 
markers. The most vigorous strains were selected and 
cultures grown on leucine selective media. The best 
strain was designated AB110 < P Cl/lGAPATi9 ) , gave 6-7.5% 
of the total cell protein as ^-AT as shown in the above 
Table 3. The strain AB110 has the following genotype: 
Mata, ura3-52, leu2-04 or both leu2-3 and leu2-112, 
pep4-3, his4-580 (cir°). 

Phosphate induction . 

Plasmid pPGT80 was digested with BamHI , the 
ends blunt-ended, followed by digestion with ttal and 
the 500bp fragment containing the GAPDH promoter and 
5 '-end of the sAg gene isolated. 

The PH05 gene was isolated from a yeast genomic 
library employing an oligonucleotide probe 
5 1 -GGCACTCACACGTGGGACTAG-3 ' derived from the published 
partial sequence (Meyhack et al . , The EMBO Journal (1982) 
1:675-680). A subfragment of this clone containing 
550bp of the 5 • -untranslated region and approximately 
80bp of coding sequence was subcloned as a BamHI -Sal I 
substitution in P BR322 to provide pPHOS. This fragment 
has the sequence 5 1 -ATGTTTAAA-3 ' , encoding the first 
three amino acids, the second and third codons 
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specifying an Ahalll site. The plasmid pHBS6 was digested 
with Nco l, blunt-ended, followed by digestion with BamHI 
and treatment with alkaline phosphatase. The PH05 pro- 
moter region was obtained by digesting the pPH05 plasmid 
with Ahalll, resecting the resulting fragment with Bal31 
for a short time, followed by digestion with BamHI and 
isolation of a 500-550bp BamHI blunt-ended fragment. 
This fragment was employed for substitution of the 
Nco l - BamH I fragment from pHBS6 and was screened for 
regeneration of the Nco l restriction site to provide 

plasmid pHBS6PH05/l. 

Plasmid pHBS6PH05/l was digested with BstEII 
* which cleaves at position -175 in the PH05 promoter. 
This molecule was blunt-ended, digested with Sai l and 
the 650bp fragment having the 5' -portion of the promoter 
domain, containing 275bp of pBR322 and 375bp of the 
PH05 promoter region isolated. This fragment was li- 
gated with the blunt-ended (BamHI) -Xbal fragment ob- 
tained from digestion of pPGT80 with BamHI/ blunt end- 
ing, followed by Xbal digestion. After digesting the 
ligated fragment with Sai l and Xba l, the resulting frag- 
ment was then substituted into pPGT16-3 which had been 
digested with Sai l and Xba l and treated with alkaline 
phosphatase. The resulting plasmid pPHO5PGT80 had a 
cassette comprising the PHOS regulatory region, the 
GAPDH promoter, the sAg gene and the ADH terminator. 
This cassette was excised from the plasmid by BamH I 
digestion, whereby a l.Skb BamH I - BamH I fragment was gel 
isolated and ligated into the BamHI site of BamH I di- 
gested and alkaline phosphatase treated pCl/1 to provide 
plasmids PH05GAP1 and PH05GAP2 where the PHOS was distal 
and proximal to the long pBR322 sequence, respectively. 

The two plasmids were transformed into yeast 
strain 2150-2-3 as described above and grown in rich 
media as described above for 8 to 10 generations in 
either high (7mM) or low(0.2mM) phosphate. Samples 
were harvested in late log phase and HBsAg determined 
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described previously. The 
in Table 4. 



0164556 



results are shown below 



Table 4- Regulation of BBsAg Production in Yeast using a 
Hybrid PH05/GAPDH Promoter. 



10 



15 



20 




Construction 



High Phosphate Low Phosphate 
(7mH) (0.2mM) 
(sAg ug/mg protein) 



Induction 
low / high 



PH05GAP-1 
PH05GAP-2 



0.08 
0.27 



0.95 
0.40 



12.0 
1.5 



25 



30 




Prom the above results, it is evident that 
effective regulation with phosphate is obtained, with 
one orientation being superior to the other. 

It is evident from the above results, that 
highly efficient expression can be obtained, either 
constitutive or regulated, by providing for truncated 
promoter regions of yeast glycolytic enzyme gene promo- 
ters, employing the 3- domain proximal to the coding 
region of the gene in conjunction with a 5 '-portion or 
second domain of the promoter region of a yeast gene 
subject to inducible regulation by a nutrxent, e.g., 
carbon source or phosphate, temperature, or other exter- 
nally controllable source or condition. Alternatively, 
the second domain may be replaced by prokaryotxc se- 
quences of at least about lkb or greater, which provide 
for constitutive enhancement in the absence of the sec- 
ond domain of the promoter region. Thus, a wide variety 
of genes exogenous to yeast may be expressed m high 
yield in high percentages of the total protein of the 
yeast host. 
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Although the foregoing invention has been 
described in some detail by way of illustration and 
example for purposes of clarity of understanding, it 
will be obvious that certain changes and modifications 
may be practiced within the scope of the appended claims. 
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WHAT IS CLAIMED IS: 

1. in a method for obtaining expression of 
a foreign structural gene in a yeast host employing 
hybrid constructions involving transcriptional and 
translational regulatory signals recognized by said 
host joined to a foreign gene for transcriptional and 
translational regulation of expression in said host, 
wherein said construction is introduced into said host 
and said foreign gene is expressed to produce the en- 

coded polypeptide, 

the improvement which comprises: 
employing for transcriptional regulation, a 
DNA sequence having first and second domains, 

said first domain being proximal to said struc 
tural gene, being of less than about 600bp and having a 
sequence which includes the RNA polymerase binding site 
and transcription initiation site of a yeast glycolytic 

enzyme gene; and 

said second domain distal to said structural 

gene providing for enhanced efficiency of transcription 

and being the regulatory region of a regulated yeast 

gene other than the natural gene of said first domain 

or a prokaryotic sequence of at least about SOObp. 

2. A method according to Claim 1, wherein 
said yeast glycolytic enzyme gene of said first domain 
is glyceraldehyde-3-phosphate dehydrogenase. 

3. A method according to Claim 2, wherein 
said regulatory region is the GAL1 or GAL10 regulatory 
region. 

3 4. A method according to Claim 2, wherein 

said regulatory region is the PH05 regulatory region. 



0164556 

43 

5. A method according to Claim 1, wherein 
said yeast glycolytic enzyme of said first domain is 
pyruvate kinase. 

6. A method according to any of Claims 1, 

5 2, 3, 4, or 5, wherein said polypeptide is Hepatitis B 
surface antigen, superoxide dismutase, or a ^antitrypsin. 

7. In a method for obtaining expression of 
a foreign structural gene in a yeast host employing a 
hybrid construction involving transcriptional and trans- 

10 lational regulatory signals recognized by said host 
* joined to a foreign gene for transcriptional and trans - 
lational regulation of expression in said host, wherein 
said construction is introduced into said host and said 
foreign gene is expressed to produce the encoded poly- 

15 peptide, the improvement which comprises employing for 
transcriptional regulation, a DMA sequence having first 
and second domains, 

said first domain being proximal to said struc- 
tural gene, being of about 200-500bp, being derived 

20 from the 5 1 -untranslated region of the yeast glyceralde- 
hyde- 3 -phosphate dehydrogenase or pyruvate kinase gene 
and extending downstream to at least nucleotide -10; 
and 

said second domain distal to said structural 
25 gene, being a prokaryotic sequence of at least about 
lOOObp, wherein said yeast cells containing said con- 
struct are grown to high density; and 

isolating said polypeptide. 



8. In a method for obtaining expression of 
30 a foreign structural gene in a yeast host employing 

hybrid constructions involving transcriptional and trans- 
lational regulatory signals recognized by said host 
joined to a foreign gene for transcriptional and trans - 
lational regulation of expression in said host, wherein 
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said construction is introduced into said host and said 
foreign gene is expressed to produce the encoded poly- 
peptide, the improvement which comprises employing for 
transcriptional regulation, a DNA sequence having first 

and second domains, 

said first domain being proximal to said 
structural gene, being of about 200-500bp and including 
the RNA polymerase binding site and transcription ini- 
tiation site of the yeast glyceraldehyde-3 -phosphate 
dehydrogenase or pyruvate kinase gene; and 

said second domain distal to said structural 
gene having a sequence providing for regulation of the 
acid phosphatase PH05 gene or the GAL1 or GAL10 gene, 

growing said host containing said construct 
in a nutrient medium to high density of said host under 
non- inducing conditions; 

changing the conditions to inducing conditions 
to induce expression of said structural gene; and 

isolating the polypeptide. 

9. A DNA construct comprising in the down- 
stream direction a yeast promoter region having first 

m 

and second domains, an insertion sequence of from about 
4-100bp having at least one unique restriction site for 
the construct, and a termination region, wherein said 
two domains are a distal domain of at least about lOObp 
of a DNA sequence providing for regulation of a yeast 
gene or a prokaryotic sequence of at least about 500bp 
and a proximal domain which includes the RNA polymerase 
binding site and the transcription initiation site of a 
yeast glycolytic enzyme gene. 

10. A DNA construct for expression of a 
structural gene foreign to yeast, comprising in the 
downstream direction: 
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a promoter region comprising two domains and 
providing for transcriptional and translational signals 
for initiation in yeast; 

a structural gene under regulatory control of 
said promoter region; and 

a termination region providing for transcrip- 
tional and translational termination in yeast; 

wherein the distal domain comprises a sequence 
of at least about lOObp including a sequence for regula- 
tion of a regulatable yeast gene or a prokaryotic se- 
quence of at least about 500bp and the proximal domain 
includes an RNA polymerase binding site and transcrip- 
tion initiation site of a yeast glycolytic enzyme gene. 

11. A construct according to Claim 10, where- 
15 in said gene is a ^antitrypsin, superoxidase dismutase ' 

or Hepatitis B surface antigen. 

12. A modified yeast host containing as an 
extrachromosomal element a plasmid vector having a rep- 
lication system, at least one marker, and a cassette 

20 comprising in the downstream direction of transcription, 
a promoter region functional in yeast for transcriptional 
and translational initiation, a structural gene foreign 
to said yeast under the transcriptional and translational 
control of said promoter region, and a terminator region 

25 functional in said yeast, wherein said promoter region 
has two domains, a distal domain distant from said gene 
of at least about lOObp comprising a sequence providing 
for regulation of a regulated yeast gene or a prokaryo- 
tic sequence of at least about SOObp, and a proximal 

30 domain proximal to said gene, comprising the RNA poly- 
merase binding site and transcription initiation site 
of a yeast glycolytic enzyme gene and of about 200 to 
600bp and extending to at least the -20nt of said yeast 
glycolytic enzyme gene. 
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13. A yeast cell according to Claim 12, where- 
in the yeast glycolytic enzyme gene for said proximal 
domain is glyceraldehyde-3 -phosphate dehydrogenase or 
pyruvate kinase. 

14. A method for producing a foreign protein 

which comprises: 

growing cells according to Claim 12 , wherein 

said foreign protein is produced. 

15. A method according to Claim 14, wherein 
said foreign protein is (^-antitrypsin, human superoxide 
dismutase or Hepatitis B surface antigen. 

16. A method according to Claim 2, wherein 
said regulatory region is the ADH1I regulatory region. 
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Description 

Field of the Invention 

5 With the advent of hybrid DNA technology, production of mammalian proteins in microorganisms 
became a reality. For the first time, mammalian proteins could be synthesized in a unicellular microorgan- 
ism by introduction of a gene encoding for the mammalian protein under the transcriptional and translational 
control of regulatory sequences recognized by the microorganism host. The introduction of these foreign 
constructions into a microorganism host resulted in competition between the regulatory signals of the 

ro construct and the regulatory signal endogenous to the host for the host systems involved with expression. 
The structural gene of interest is usually directed to a product which is nonproductive and may be injurious 
to the host. Therefore, host cells which can turn off the foreign gene can effectively dominate modified host 
cells. 

Substantial progress has been made in Isolating sequences concerned with transcriptional and tran- 
is slational regulation for protein expression. But frequently flanking sequences, as well as distant sequences, 
may also affect the efficiency and regulation of the expression of the protein. Therefore, as one manipulates 
these various sequences, removing them from their native environment, and joining them to unnatural 
sequences, that is sequences other than the wild-type sequence, one can never be certain as to the result. 
In order to enhance the economies of producing proteins in microorganisms, there have been 
20 substantial efforts directed to improving the efficiency of transcription and translation, maximizing the 
proportion of total protein directed to production of the desired product, enhancing the viability of the 
modified host, as well as improving the efficiency with which the modified host may be obtained. 

Description of the Prior Art 

25 

Guarente et al., Proc. Natl. Acad. Sci. USA (1982) 79:7410-7414, describe a hybrid promoter region 
employing the GAL4 regulatory region. Guarente and Ptashne. ibid. (1981) 782199-2203, report the 
presence of two domains in a yeast promoter, with a region upstream from the TATA sequence providing 
an activation site. Kramer et a}., ibid. (1984) 81:367-370, describe the regulated expression of a human 
30 interferon gene in yeast employing the yeast acid phosphatase promoter, where expression is induced by 
phosphate or a temperature shift. Tekamp-Olson et al.. Cold Spring Harbor Meeting, Molecular Biology of 
Yeast, 1983, describe the absence of deleterious effects on yeast viability when employing "short" 
promoters, as distinct from the presence of such effects when employing an extended promoter region or 
"long" promoters. 

35 EP-A 72 318 concerns the expression of the Hepatitis B surface antigen in yeast using the alcohol 
dehydrogenase promoter, devoid of its upstream regulatory region. Nature, Vol. 300 (1982), p. 724 - 728, 
discloses the promoter from one yeast gene used with a different yeast structural gene but does not 
disclose a hybrid activation site construct. EP-A 73657 discusses the possibility of constructing yeast hybrid 
promoters consisting of the regulator of one gene joined to the promoter of a second, highly expressed, 

40 gene in order to control expression. 

Summary of the invention 

The invention is directed to a method for obtaining expression of a foreign structural gene in a yeast 
45 host employing hybrid constructions involving transcriptional and translational regulatory signals recognized 
by said host joined to a foreign gene for transcriptional and translational regulation of expression in said 
host, wherein said construction is introduced into said host and said foreign gene is expressed to produce 
the encoded polypeptide, comprising: 

employing for transcriptional regulation, a DNA sequence having first and second domains, 
so said first domain being proximal to said structural gene, being of less than about 600bp and having a 
sequence which includes the RNA polymerase binding site and transcription initiation site of a yeast 
glyceraldehyde-3-phosphate dehydrogenase gene; 

said second domain being proximal to said first domain and distal to said structural gene, providing for 
enhanced efficiency of transcription and being the regulatory region of a regulated yeast gene other than 
55 the natural gene of said first domain; and 

said regulatory region being the PH05 regulatory region. 

According to a preferred embodiment of the invention, said polypeptide is Hepatitis B surface antigen, 
superoxide dismutase or a 1 -antitrypsin, and said regulatory region is the ADHD regulatory region. 

3 
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According to a further embodiment, the invention is directed to a method for obtaining expression of a 
foreign polypeptide in a yeast host employing a hybrid construction involving transcriptional and tran- 
slational regulatory signals recognized by said host joined to a foreign gene for transcriptional and 
translation^ regulation of expression in said host, wherein said construction is introduced into said host and 
5 said foreign gene is expressed to produce the encoded polypeptide, comprising employing for transcrip- 
tional regulation, a DNA sequence having first and second domains, 

said first domain being proximal to said structural gene, being of about 200-500bp, being derived from the 
S'-untranslated region of the yeast glyceraldehyde-3-phosphate dehydrogenase or pyruvate kinase gene 
and extending downstream to at least nucleotide -10; and 
ro said second domain being distal to said first domain and proximal to said structural gene, being a 
prokaryotic sequence of at least about 1000bp, wherein said yeast cells containing said construct are grown 
to high density; and 
isolating said polypeptide. 

According to a further preferred embodiment, the invention is concerned with method for preparing a 
15 polypeptide in a yeast host, where the polypeptide is heterologous to the yeast and may be produced in 
low percentage amounts of total protein, the improvement which comprises: 

joining an open reading frame DNA sequence coding for said polypeptide with a second open reading 
frame DNA sequence coding for heterologous human superoxide dismutase to form a fusion polypeptide; 
introducing the sequence coding for said fusion polypeptide under conditions for expression in said yeast 
20 which includes an inducible transcriptional initiation regulatory region that consists essentially of, reading in 
a 5* to 3* direction, a yeast glycolytic enzyme promoter region and adjacent alcohol dehydrogenase-2 
control region, whereby said fusion polypeptide is expressed; 
isolating said fusion polypeptide to provide the polypeptide in high yield. 

25 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a diagrammatic view of plasmids GAP1-6; 
Figure 2 is a diagrammatic view of plasmids Pyk1-6; and 

Figure 3 indicates the DNA linker sequence and a flow diagram showing its use in a construct for hSOD. 

30 

DESCRIPTION OF THE SPECIFIC EMBODIMENTS 

Novel methods are provided employing fragments for enhanced expression of structural genes in a 
yeast host. Constructs employing a hybrid promoter region provide for enhanced efficiencies of transforma- 
35 tion and greatly improved viability of the yeast host as contrasted with those employing a wild-type yeast 
promoter. Concomitant with the improved viability is increased expression of a structural gene, in 
comparison with the truncated promoter region, and, therefore, greatly enhanced overall yields of expres- 
sion products. 

For the purposes of the subject invention, the "promoter region" is divided into two domains: (1) the 

40 structural gene proximal region, which includes the transcription initiation site, the "TATA" sequence 
capping sequence, as appropriate, and an RNA polymerase binding sequence, which sequence intends a 
sequence which includes nucleotides upstream from the initiation site for directing the initiation of synthesis 
of the messenger RNA; and (2) a distal region which provides for regulated or constitutive expression, with 
enhanced expression as compared to the first domain linked to non-functional yeast DNA. 

45 The hybrid promoters of the subject invention employ the RNA polymerase binding region of a yeast 
glycolytic enzyme promoter and a region upstream from said polymerase binding region, which is different 
from the wild-type upstream region of the RNA polymerase binding region and provides for enhanced 
efficiencies of transcription. This distal region will be derived from either a sequence, usually a yeast 
sequence, involved in regulation of transcription, or a prokaryotic sequence which provides for enhanced 

50 constitutive expression of the desired gene. 

Conveniently, cassettes or constructs can be prepared which provide for one or more restriction sites 
intermediate the promoter region and a related terminator region where the structural gene may be inserted, 
so as to be under the transcriptional control of the hybrid promoter region. By having one or more 
restriction sites, one can provide for ease of insertion of the structural gene intermediate the transcription 

55 initiation and termination regions. The cassettes which can be prepared comprising the transcriptional 
initiation and termination region, having appropriate restriction sites for structural gene insertion can be 
cloned in prokaryotic vectors, so that after insertion of the structural gene, the resulting cassette, including 
the structural gene, may be cloned, isolated and purified, before introduction into a yeast vector. 
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The cassette, will for the most part, have the following formula: 
-(P.R.(2)-P.R.<1 ))-R.S.-T.R.- 
s wherein: 

P.R.(1) is the promoter region proximal to the structural gene and having the transcription initiation site, 
the RNA polymerase binding site, and including the TATA box. the CAAT sequence, as well as translation^ 
regulatory signals, e.g., capping sequence, as appropriate; 

P.R.(2) is the promoter region joined to the 5'-end of P.R.(1) associated with enhancing the efficiency of 
to transcription of the RNA polymerase binding region; 

R.S. is a sequence having one or more restriction recognition sites, preferably at least two restriction 
recognition sites, where the sites may result upon restriction into blunt ends or overhangs; 

T.R. intends the termination region, which will include the terminator, which may be a stem and loop 
structure, and which may be associated with one or more stop codons. a polyadenylation signal sequence. 
is if any, as well as any other transcriptional and translationa) termination sequences. 

P.R.(1) will generally be at least about 150bp, more usually at least about 200bp, usually not more than 
about 600bp, more usually not more than about SOObp, generally not more than about 450bp and preferably 
less than about 400bp; the sequence will extend in the downstream direction of transcription to about bp 
+3, more usually bp -1 and may extend only to bp -20, more usually to bp -10 (the numbering intends that 
20 +1 is the first bp of the initiation codon with which the promoter region is associated in the wild-type host, 
while -1 is the immediately upstream bp and the integers increase in the direction of transcription; 

P.R.(1) will be derived from a strong yeast promoter, normally a glycolytic enzyme promoter, such as 
glyceraldehyde-3-phosphate dehydrogenase, pyruvate kinase, alcohol dehydrogenase, 
phosphoglucoisomerase. triose phosphate isomerase, phosphofructokinase, etc.; 
25 P.R.(2) will provide for an enhancing function of transcription, which enhancing function may provide for 
constitutive or regulated transcription; regulators will be derived from regions associated with regulation of 
yeast genes, other than the natural or wild-type gene associated with the first domain in the wild-type or 
natural host, such as acid phosphatase (PH05) or alcohol dehydrogenase II (ADM I and II), etc. For yeast 
regulatory sequences, the domain will usually be at least about 100bp, more usually at least about 200bp, 
30 for convenience generally not exceeding about 3kbp, usually not exceeding about 1kbp, desirably not 
exceeding about SOObp. The regulatory region will generally begin at least about 200bp from the initiation 
codon, usually at least about 300bp and may begin at 400bp or farther upstream from the initiation codon. 

Regulation can be as a result of a change in the chemical or physical environment of the host, such as 
a change in carbon source, e.g., glucose to galactose or vice versa; a change in concentration of a nutrient, 
35 e.g., an inorganic nutrient such as a phosphate; or a change in temperature, e.g., 25 # C to 35 *C. 
Constitutive transcription can be achieved employing prokaryotic sequences of at least about 1kbp for 
convenience, generally not exceeding about 5kbp; conveniently, the prokaryotic sequence can be obtained 
from the vector in which the cassette is cloned, illustrative vectors including pBR322. lambda. Charon 4A, 
pACYC184, pUC5, etc. 

40 R.S. will generally be at least 4bp, more usually at least 6bp. and may be lOObp or more, more usually 
being not more than about 60bp and may include one or more, usually not more than about 10 restriction 
sites, where such restriction sites may be illustrated by EcoRI, Bam HI, Sai l, Hindlll, Alul, Aval, Taq l, Hpa l, 
etc., having at least one unique restriction site for the construct sequences. 

T.R. is the termination region which will include the necessary transcriptional and translational signals 
45 for termination, such as the polyadenylation site, etc.; 

T.R. will generally be at least about 100bp. more usually at least 150bp. and generally less than about 
ikbp. usually less than about 600kbp; the termination region may be derived from any convenient yeast 
sequence, so long as the terminator balances the promoter, conveniently being derived from a glycolytic 
enzyme terminator, where the terminator may be associated with the same or different enzyme with which 
so the promoter is associated. 

Where a cassette is cloned in a bacterial vector, the construction will have the following formula: 



-(P.R.(2)-P.R-(l))-R.S.-T.R.-Rep(B)-(M(B)) - 

I <-)- 

b 
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wherein all the symbols have been defined previously, except for 

Rep (B), which intends a replicon or replication system recognized by a prokaryotic host and may be 
derived from a plasmid or phage, such as ColE1, and R plasmid, e.g., pRK290, lambda, e.g., Charon 4A, 
Xdv, etc.; 

5 M is a marker which provides for selection of hosts containing the construction, where (B) intends a 
prokaryotic, e.g., bacterial, host and a intends an integer of from 0 to 3, usually 1 to 2, although additional 
markers may be present, where the marker allows for selection of the host containing the construct as well 
as providing for selective pressure on maintaining hosts having the construct; the markers include biocide 
resistance, such as antibiotic resistance, toxin resistance and heavy metal resistance; providing prototrophy 
w to an auxotrophic host; providing immunity; and the like; 

the markers may provide for complementation of an auxotrophic host, e.g., his", ura", trp~,leu~ 
genotype, resulting in prototrophy; resistance to metals, such as cup + genotype; resistance to antibiotics, 
such as amp r , tc r , cam r , str*. tui* genotype, etc.; 

b is 0 or 1 , intending that the construction is either linear or circular, usually circular. 
75 The above construct can be used for insertion of a wide variety of structural genes, both prokaryotic 
and eukaryotic, both naturally occurring and synthetic, where the genes may include signal leaders for 
secretion, and the like. The genes may express enzymes, hormones, proteins from pathogens for vaccines, 
structural proteins, lymphokines, membrane surface proteins, immunoglobulins, blood proteins, or the like. 
The particular structural gene which is inserted is not critical to this invention and any polypeptide or protein 
20 of interest may be prepared employing the constructions of the subject invention. The structural genes will 
usually be foreign to the yeast host, where foreign intends different from wild-type yeast structural genes 
and from a source that does not normally exchange genetic information with yeast 

Usually, the structural gene will be at least about 36bp, and not more than about 20kbp, usually not 
more than about 3000bp, usually not more than about 1500bp. Included in the structural gene may be non- 
25 coding flanking regions, the S'-fianking region normally being quite short, usually less than about 30bp, 
while the 3*-flanking region may be extended, usually not exceeding about 500bp. Thus, the structural gene 
fragment will usually include the translational stop codons for proper termination of amino acid chain 
extension. 

When the structural gene has been inserted into the cassette which is joined to a yeast replication 
30 system, normally including one or more markers recognized by yeast, the resulting construct will have the 
following formula: 



-<P.R. <2)-P.R. (1) )-gene-T.R.-Rep(Y)-(M(Y)) - 
35 I 

wherein ail of the symbols have been defined previously except for: 

gene, which intends the structural gene, having its initiation codon and stop codons as appropriate; and 

40 Y, which intends that the symbol is related to yeast. 

Convenient yeast replication systems include the 2um plasmid replication system, combination of 
CEN3 and ARS1 or ARS3, or the like. The replication systems may be high or low copy number, depending 
on the effect of the construct on the viability of the host. While the indicated replication systems are those 
which have found common employment, any replication system useful in yeast may be employed which 

45 provides for efficient replication and maintenance. Often the structural gene will be inserted into an 
appropriate shuttle vector capable of replication and selection in either a yeast or bacterial host, where the 
resulting construction will have the following formula: 

5 0 -(P.R.(2)-P-R.(l))-gene-T.R.-Rep(Y)-(M(Y)) -Rep(B)-(M(B)) - 
I * * | 



where all symbols have been defined previously. Also, it is, of course, understood that the cassette without 
55 an inserted structural gene but containing the restriction enzyme recognition sequence, R.S., may be 
propagated in yeast or contained within a shuttle vector, where the construction will have the following 
respective formulae: 
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-(P.R. (2)-P.R.(l))-R.S.-T.R.-Rep(Y)-(M(Y)) - 
I L_l 

5 

-(P.R.(2)-P.R.(1))- R.S.-T.R.-Rep(Y)-(M(Y)) 4 -Rep(B)-(H(B)) a - 



10 

where all symbols have been defined previously. 

The various fragments which form the cassette and final constructions may be joined together in 
accordance with conventional ways. In many cases, genes have been isolated and restriction mapped, as 

rs well as sequenced. To that extent, one can select the sequence of interest by restriction of the gene, 
employing further manipulation as necessary such as resection with Bal31, in vitro mutagenesis, primer 
repair, or the like, to provide a fragment of a desired size, including the desired sequence, and having the 
appropriate termini. Linkers and adapters can be used for joining sequences, as well as replacing lost 
sequences, where the restriction site is internal to the region of interest. The various fragments which are 

20 isolated, may be purified by electrophoresis, electroe luted, ligated to other sequences, cloned, reisolated 
and further manipulated. 

The use of regulatory sequences for controlling transcription of the structural gene of interest allows for 
growing the host cells to high density with no or low levels of expression of the structural gene, and then 
inducing expression by changing the environmental conditions, e.g., nutrient, temperature, etc. 

25 For example, for PH05 regulation one could grow the cells at high phosphate, about 1 to 10mM, and 
then decrease the phosphate concentration to about 0.1 to 0.5mM. For temperature sensitivity, one could 
grow the cells at 25* to 37*C and then change the temperature as appropriate by about 5* to 20 *C. The 
host cells would have the regulatory system associated with the regulatory region employed. 

Various techniques will be exemplified in the Experimental section of this application, which techniques 

30 can be used as paradigmatic for constructions employing fragments from sources other than those 
exemplified. Of particular interest, as evidenced by the Experimental section, will be the use of the 
glyceraldehyde-3-phosphate dehydrogenase promoter region for the RNA polymerase binding site in 
conjunction with regulator sequences, such as those associated with PH05 or ADHII. The PH05 and ADHII 
sequences refer to regions associated with the PH05 and ADHII genes which provide for transcriptional 

35 regulation of the PH05 and ADHII gene, respectively. 

The following examples are offered by way of illustration and not by way of limitation. 

EXPERIMENTAL 

40 All DNA manipulations were done according to standard procedures. See Molecular Cloning , T. Maniatis 
et al., Cold Spring Harbor Lab., 1982. Enzymes used in cloning were utilized as per the manufacturer's 
specifications. Enzymes were obtained either from New England Biolabs or Bethesda Research Laborato- 
ries. Procedures with these enzymes employed the supplier's directions. Yeast were transformed and grown 
using a variety of media including selective medium (yeast nitrogen base without leucine); YEPD medium, 

45 containing 1% (w/v) yeast extract, 2% (w/v) peptone and 2% (w/v) glucose, and others as appropriate 
and/or detailed below. In the case of plating medium contained 2% (w/v) agar and for transformation 3% top 
agar. Hepatitis B surface antigen was determined after lysis of yeast by glass bead agitation and 
clarification by centrifugation, using the Ausriall assay (Abbott Laboratories). Protein is determined by the 
Coo mas si e dye binding method. 

50 

Construction of GAL regulator containing plasmids . 

Plasmid pLGSDS is prepared as described in Guarente et al., (1982) supra . The plasm id was 
manipulated as follows: After restriction with Xho l, the overhangs were filled in with the Klenow fragment of 
55 DNA polymerase I ("Klenow fragment"), ligated with EcoRI linkers (GGAATTCC) and then completely 
digested with EcoRI and Sau 3A to provide a 370bp fragment which was isolated by gel electrophoresis and 
included the intergenic sequence between GAL1 and GAL10 genes of yeast, and provides for the GAL4 
regulation sequence of the GAL1 and GAL10 genes. 
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This fragment was inserted into pBR322 which had been completely digested with Eco RI and BamHI, 
followed by treatment with alkaline phosphatase to prevent ofigomerizatton. The resulting plasmid pBRGAL4 
was treated in two different ways. 

In the first procedure pBRGAL4 was completely digested with Sau3A, the overhangs filled in with the 
5 Klenow fragment, and the resulting blunt-ended fragment ligated with Sail linkers (CGTCGACG), followed by 
digestion with Sail and Xhol. The resulting 370bp fragment was isolated by gel electrophoresis. This 
fragment has the original 370bp yeast GAL4 regulator sequence with Xhol and Sail termini. 

The second fragment was obtained by complete digestion of pBRGAL4 with Xhol and Sail to provide a 
Xhol-Sajl fragment which included the 370bp yeast GAL4 regulator sequence as well as about 280bp of 
10 pBR322, the GAL4 sequence extending from Sau3A to Sai l. 

The two fragments were then cloned in the plasmid plots, plots was prepared by inserting the 40bp 
polylinker fragment of the following sequence 



,s EcoRI BamHI Boll I Xbal Hindi 1 1 

~ r i i — r — r 

5 • AATTCCCGGGGATCCGTCGACCTGCAGATCTCTAGAAGCTTCAG 
3 1 GGGCCCCTAGGCAGCTGGACGTCTAGAGATCTTCGAAGTC 

III I 
20 Smal Sail PstI PvuII 



into pBR322 as an EcoRI-Pvull substitution followed by insertion of the trp-lac promoter (Russell and 
Bennett. Gene (1982) 20:231-245) into the Pyull site with transcription oriented toward the polylinker 

25 sequence, plots was completely digested with Sail, followed by treatment with alkaline phosphatase and the 
370bp and 650bp fragments independently inserted into plots to provide plasmids plot5GAL4/370 and 
plot5GAL4/650, respectively. Each of the plasmids was then completely digested with Bam HI and Sail to 
reproduce the individual fragments extended by 6bp of the polylinker fragment. These fragments were then 
ligated into pC1/1, which had been completely digested with Bam HI and Sail followed by treatment with 

30 alkaline phosphatase to prevent recircularization. Plasmid pC1/1 is a derivative of pJDB219 (Beggs, Nature - 
(1978) 275 : 104 ) in which the region corresponding to bacterial plasmid pMB9 in pJDB219 has been 
replaced by pBR322 in pC1/1. The resulting plasmids were designated pC1/1GAL4/370 and 
~*pC1/lGAL4/650. respectively. The Bam HI-Sall fragment is located in the pBR322 portion of the vector 
pC1/1. 

35 The next construction develops a hybrid promoter for expression of the Hepatitis B surface antigen 
(HBsAg or sAg), employing the RNA polymerase binding region of GAPDH. The previously prepared 
plasmid pHBS56/16-3, a yeast shuttle vector containing the alcohol dehydrogenase 1 (ADH1) promoter, the 
HBsAg gene and ADH terminator as a Sghl fragment, was digested with Sphl and the ends modified with 
Bam linkers. The Bam linkers have the sequence CGGATCCG. 

40 pHBS56/16-3 was prepared as follows: A Taql-Hpal fragment obtained from the HBsAg coding region 
which included 26bp of the pre-sAg region, 681 bp of the sAg region and 128bp of the 3'-untranslated 
region, was linked with Eo)RI linkers and cloned at the EcoRI site in pBR322. The EcoRI linkers have the 
sequence GGAATTCC. The plasmid pHBSS was thus obtained. 

After digesting pHBSS with EcoRI, the digest was resected with Bal31 and religated with EcoRI linkers 

45 (GGAATTCC). After digestion with EcoRI the material of about 800bp was isolated from a polyacrylamide 
gel. This isolate was then recloned into pBR322 which had been digested with EcoRI and treated with 
alkaline phosphatase. Where the resection was to the sequence CATGG. which included the methionine 
codon, the EcoRI linker created an Ncol site. The plasmids were screened for the presence of an Ncol site 
and one of the plasmids chosen for further manipulation. This plasmid, designated pHBS5-3, was restricted 

so with EcoRI, the EcoRI fragment made blunt-ended with Klenow fragment and dNTPs, and the blunt-ended 
fragment was then restricted with Xbal to provide an about lOObp fragment having an Xbal overhang and 
blunt end at the former EcoRI site. 

pHBSS was then digested with Clal, made blunt-ended with the Klenow fragment and dNTPs, digested 
with Xba l, followed by alkaline phosphatase treatment. The 100bp fragment was then inserted into the 

55 vector to provide the plasmid pHBS6. Upon sequencing of the blunt-ended ligation site, it was found that an 
adenosine had been lost, so as to lose the EcoRI site, where the sequence was now ATCGATTCCCATGG. 
The Clal and Ncol sites were retained. The loss of the A resulted in pHBS6 having a single EcoRI site. 
pHBS5-3 was digested with EcoRI and the resulting EcoRI fragment having the sAg fragment isolated 
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by gel electrophoresis and inserted into the vector pHBS16 (Valenzuela et al., Nature (1982) 298:347-350). 
This plasmid has the ADH1 promoter and the sAg gene in an Ew)RI fragment in a plasmid containing the 
2um origin, a Trpl gene and pBR322. The plasmid was digested with EcoRI, treated with alkaline 
phosphatase to prevent recircularization. and the EcoRI fragment from pHBS5-3 inserted to provide 

5 pHBS16-3, where the sAg gene isolated as a Taql-Hpal fragment had been modified by Ba!31 resection. 
The plasmid pHBSl6-3 was digested with Sph l and Xbal to provide a fragment which had the ADH 
promoter at the Sgh terminus and the 5'-end of the sAg gene. 

pHBS56 was then digested with Sgh I. pHBS56 was prepared from pC1/1 by digestion with Sghl, which 
deletes a portion of the plasmid spanning the 2um-pBR322 joint. The active portion of the ADH1 promoter 

to region is contained within the Sphl-Hindlll fragment of approximately 300bp (Bennetzen et ah, J. Biol. 
Chem. (1982) 257:301). The Sphl site in the ADH promoter begins at position -413 and the yeast terminator 
sequence is contained within a Hindlll-Sphl fragment of about 330bp. In each case the Sphl site is distal to 
the coding region. A 1500bp ADH1 promoter fragment terminating at position -9 (Hitzeman et al., Nature - 
(1981) 293:717) and an approximately 450bp terminator unit from nucleotides 913 to 1368 in the ADH gene 

15 nucleotide sequence were joined at a Hindlll site between the fragments and cloned into the Bam HI site of 
the vector YEp13 (Broach and Hicks, Gene (1979) 8:121) to provide pADH5. 

The HBsAg-DNA segment of pHBS5 was excised by EcoRI digestion, blunt-ended with the Klenow 
fragment and joined at both ends with Hindlll linkers, CAAGCTTG. After digestion with Hindlll, the HBsAg 
fragment was inserted into the Hindlll site of the plasmid pADH5 which had been digested at the Hindlll site 

20 intermediate the ADH1 promoter and terminator sequence. A plasmid with the HBsAg gene in the correct 
orientation as determined by restriction analysis was designated pHBS22. The cassette was included 
between two Sphl restriction sites. pHBS22 was digested with Sph l to obtain a fragment of about 1500bp 
and inserted into Sph l digested pC1/1 to provide pHBS56 which was cloned in E. colt HB101 . 

pHBS56 was digested with Sphl and Xba l to provide a 1.1 kb fragment having the ADH terminator 

25 region and the 3'-portion of the sAg gene with the Sph l site proximal to the terminator region. The 1.1 kb 
Sph l-Xbal fragment was joined to the Sph l -Xba l fragment from pHBSl6-3, which resulted in providing the 
complete sAg gene in the correct orientation between the ADH promoter and terminator. This Sph l-Sphl 
fragment was then ligated to Sph l digested pHBS56, replacing the cassette of pHBS56 to provide the 
plasmid pHBS56/l6-3 with the resected sAg coding region fragment. The cassette was then excised from 

30 pHBS56/16-3 by digestion with Sph l, followed by chewing back the overhangs with the Klenow fragment in 
the presence of dNTPs, then ligated with Bam HI linkers, followed by digestion with Bam HI to provide a 
1 .6kb fragment which was isolated by gel electrophoresis. The fragment included the ADH promoter region, 
the sAg gene and ADH terminator region, as described above. This fragment was inserted into the Bam HI 
site of pBR322 to provide pPGT16-3 which was digested with Bam HI and Xbal and the resulting 1.1 kb 

35 fragment gel isolated, where the Xbal- Bam HI fragment had the 3' portion of the sAg gene and the ADH 
terminator region. 

pHBS6 was digested with Xba l and Ncol and the 94bp fragment gel isolated to provide the 5'-portion of 
the sAg gene. A synthetic adapter was prepared of the formula 

40 CGA 2 TA 3 (CA) 3 TA 3 CA 3 CAA 

T 2 ATg ( GT ) 3 AT 3 GT 3 GTTGTAC 

45 having Tagl and Ncol termini and providing the -25 to -1 nucleotides of the GAPDH (GAP49) promoter and 
the initiation codon of the sAg gene. This synthetic fragment, the Ncol-Xbal fragment, and the Xbal-BamHI 
fragment were ligated simultaneously, followed by digestion with Tagl and Bam HI. The resulting fragment 
was then substituted into pBR322 linearized with Clal and Bam HI, followed by treatment with alkaline 
phosphatase. The resulting ptasmid, which contains the -1 to -25bp of the GAPDH promoter region, the sAg 

so gene, and the ADH terminator, where the Nco l restriction site is tost was called pHBS6LGAPsAgtADH. 

pGAP1, a plasmid prepared by insertion of a Hindlll fragment containing the GAPDH gene GAP49 
(Holland and Holland, J. Biol. Chem, (1979) 254:5466-5474) inserted in the Hindlll site of pBR322, was 
digested with Hinfl and a 500bp promoter containing fragment isolated. The fragment was resected with 
Bal31 to remove about 50 or 90bp, followed by ligation with Hindlll linkers and digestion with Hindlll. 

55 pBR322 was digested with Hindlll, followed by treatment with alkaline phosphatase and the about 450 or 
410bp fragment inserted to provide pGAP128 and pGAP396, respectively. 

pGAP128 was digested with Hindlll, the fragment made blunt-ended with the Klenow fragment and 
dNTPs and the resulting 450bp fragment isolated by gel electrophoresis. This fragment was inserted into 
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Sma l digested plots, which had been treated with alkaline phosphatase, to provide plasmid plot5pGAP128, 
which contained about -400 to +27bp of the GAPOH promoter and coding region. Plasmid plot5pGAP396 
was prepared from pGAP396 in an identical manner and thus differs from plasmid plot5pGAP128 in having 
about 15-30 fewer bp at each terminus of the GAPDH promoter region (about -385 to -3). 

5 Plasmids GAP1-GAP4 were then prepared in the following manner. Plasmid plot5pGAP128 was 
digested with Tagl and Bam HI to provide an about 390bp Taql -Bam HI fragment which included the -26 to 
about -400bp of the GAPOH promoter region and a portion of the Hindlll and plots polylinker. 
pH BS6LG APsAgtADH plasmid was also digested with Tagl and Bam HI and a 1.1 kb Taql- Bam HI fragment 
containing the 3'-terminus of the GAPOH promoter region, the sAg gene and the ADH terminator region was 

io gel isolated and ligated to the other Taql -Bam HI fragment to provide a Bam HI -Bam HI fragment which 
included approximately 400bp of the GAPDH promoter region, the sAg gene in proper orientation for 
transcriptional regulation by the GAPDH promoter, followed by the ADH terminator region. This fragment 
was ligated into pBR322 which had been digested with Bam HI and treated with alkaline phosphatase to 
provide plasmid pPGTSO. This Bam HI cassette could now be isolated and inserted into plasmid pCl/1, at 

75 the Bam HI site in the pBR322 portion of pC1/1, where in plasmid GAP1 the ADH terminator region is 
proximal to the amp f gene with the pBR322 portion divided into an approximately 4kb sequence including 
the amp f gene and a 375bp region separating the cassette from the 2am sequences. In GAP2, the 
promoter is adjacent to the long pBR322 sequence with transcription in the same direction as the amp r 
gene. The same cassette was inserted into BamHI-digested pC1/1GAL4/650 to obtain plasmids GAP3 and 

20 GAP4, where GAP3 has the GAPDH promoter distal from the GAL4 regulator region and the long pBR322 
sequence and GAP4 has the GAPDH promoter adjacent to the GAL4 regulator region, which is adjacent to 
the long pBR322 sequence. 

Plasmids GAPS and GAP6 were isolated as follows. Plasmid plot5pGAP396 was digested with Sail and 
Tag l and a fragment containing 9bp of the plots polylinker sequence and the GAPDH promoter sequence 

25 extending from about -385 to -26bp was isolated. An approximately 130bp Taql-Xbal fragment including -25 
to -1bp of the GAPDH promoter and +1 to +93bp of the sAg gene was obtained from 
pHBS6LGAPsAgtADH. A 1.1kb Xbal-Sall fragment containing the 3'-portion of the sAg gene and the ADH 
terminator as well as 6bp of plots polylinker sequence was obtained from plasmid plotSsAgtADH (described 
below— Pyruvate Kinase Promoter). These three fragments were ligated, digested with Sail and then cloned 

30 into Sail-digested pC1/1GAL4tf70. GAPS has the GAPDH promoter region adjacent to the GAL4 regulator 
region, which is proximal to the short pBR322 sequence, and GAP6 has the GAPDH promoter region distal 
from the GAL4 regulator region and proximal to the long pBR322 sequence (see Fig. 1). 
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Pyruvate kinase promoter . 



Plasmid pHBS6Pyk containing the sAg gene under the transcriptional regulatory control of the Pyk 
promoter was obtained by cloning a 4.4kb insert of yeast genomic DNA in pBR322 containing the Pyk gene 
and 911 nucleotides of S'-untranslated region, and digestion of this plasmid pPyk9.1.1 with Xbal. After 
making the ends blunted-ended, the linear fragment was digested with Bam HI providing a 912bp BamHI- 

40 blunt fragment containing the Pyk promoter and 8 bases from the Pyk coding region. This fragment was 
inserted into the plasmid pHBS6, which had been digested with Nco l, blunt-ended and digested with 
BamHI. The plasmid pHBSBPyk was totally digested with EcoRI. to obtain a fragment including the sAg 
gene and a portion of the Pyk promoter region. The fragment was made blunt-ended with the Klenow 
fragment and dNTPs, followed by ligation to Bam HI linkers, digested with Xbal, which is internal to the sAg 

45 gene, the Xbal terminus made blunt-ended with the Klenow fragment and dNTPs, followed by digestion with 
Bam HI, to provide a 580bp BamHI-blunt-ended (Xbal) fragment. The plasmid plots was digested with 
EcoRI, made blunt-ended, digested with Bam HI and treated with alkaline phosphatase and the two 
fragments joined to provide plasmids plotSPyksAgSI and plotSPyksAg.57. The two differ in that the Bam HI 
site of the latter was not regenerated during cloning, possibly as a consequence of minimal nuclease 

so contamination (digestion). 

plots was treated as previously described (EcoRI digestion, blunt-ended, BamHI digestion and treat- 
ment with alkaline phosphatase) and joined to a 1.1 kb fragment obtained by digestion of pPGT16-3 with 
Xba l, followed by blunt ending, followed by digestion with Bam HI and gel isolation. This fragment was 
introduced into plots to provide the plasmid plotSsAgtADH. Again the BamHI site in this plasmid was not 
55 regenerated, presumably due to digestion by contaminating nuclease. 

Plasmids Pyk1 and Pyk2 were prepared as follows. Plasmid plotSPyksAgSI was digested with BamHI, 
then with Xbal, and an approximately 580bp fragment containing about 480bp of Pyk promoter and 93bp of 
the 5'-end of the sAg gene was gel isolated. A 1.1 kb Xbal-Sajl fragment containing the 3'-portion of the sAg 
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gene, the ADM terminator and about 6bp of the plots polylinker was isolated from plots AgtADH. These two 
fragments were ligated, digested with Bam HI and Sail and then cloned into plasmid pC1/1, which had been 
cleaved with Bam HI and Sail and treated with alkaline phosphatase, to yield plasmid Pyk1 . Plasmid Pyk2 
was prepared similarly but the 580bp Sall-Xbal. Pyk promoter/HBsAg gene S'-end fusion fragment was 

5 isolated from plot5PyksAg.57 and included about 6bp of plots polylinker sequence upstream from the 
promoter region. Also the 1.1 kb Xbal- Bam HI fragment containing the 3'-part of the HBsAg gene and the 
ADH terminator was derived from plasmid pPGT16*3. 

Plasmids Pyk3-Pyk6 were prepared as follows. Plasmid plotSPyksAgSI was digested with Bam HI, then 
with Xbal and the about 580bp fragment containing the Pyk promoter and the 5'-part of the HBsAg gene 

w isolated as above. The 1.1 kb BamHI-Xbal fragment, containing the 3'-portion of the HBsAg gene and ADH 
terminator, was recovered from pPGTl6-3, also as above, and the two fragments ligated, digested with 
Bam HI and inserted with different orientations into the Bam HI site of pC1/lGAL4/650 (Pyk3, Pyk4). 
Plasmids Pyk5 and Pyk6 were prepared similarly except that the Sall-Xbal fragment containing the Pyk 
promoter and 5'~end of the sAg gene was isolated from plotSPyksAg.57 and the Xba l-Sall sAg gene 3'- 

75 portion/ADH terminator fusion fragment was derived from plotSsAgtADH and thus both fragments included 
approximately 6bp of plots polylinker sequence. The cassette so formed was then cloned into the Sail site 
of pCl/lGAL4/370 in opposite orientations. 

The six plasmids designated Pyk1-6 (see Fig. 2) are distinguished by Pyk1 having the promoter region 
proximal to the short pBR322 sequence; Pyk2 having the promoter region proximal to the long pBR322 

20 sequence; Pyk3 having the promoter region proximal to the short pBR322 sequence and distal from the 
GAL4 sequence; while Pyk4 has the promoter region proximal to the GAL4 region, which in turn is proximal 
to the long pBR322 sequence; PykS has the promoter region proximal to the GAL4 region which is proximal 
to the short pBR322 sequence; while Pyk6 has a promoter region distal from the GAL4 region and proximal 
to the long pBR322 sequence. 

25 These plasmids described above were transformed into S. carlsbergensis strain 2150-2-3 (available 
from Lee Hartwell, University of Washington) under conventional conditions (Hinnen et aj., Proc. Natl. Acad. 
Sci. USA (1978) 75:1929-1933). Cultures of 50-1 50ml were grown to mid or late log phase in rich media 
(YEP) under neutral conditions (3% glycerol, 2% lactic acid), and then inducing conditions ( + 2% 
galactose), or repressing conditions ( + 2% glucose) for the final 1-2 generations. After lysis with glass 

30 beads and clarification of the supernatants by centrifugation, HBsAg expression was determined as 
described above. The results for the 12 plasmids are set forth in the following Table 1 . 
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55 Construction of pPGAP . 

A yeast expression vector was prepared called pPGAP having a poly restriction site linker between the 
GAPDH terminator and short promoter region. Plasmid plot5pGAP128 was digested with Bam HI and Tagl to 
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yield an approximately 390bp Bam HI-Taql fragment having the -400 to -26bp of the GAPDH promoter. The 
Bam HI-Taql fragment was ligated to a synthetic fragment having the following sequence: 

CGA 2 TA 3 (CA) 3 TA3CA 3 CACCATG 3 A 2 T 2 CGT 2 AG 2 
T 2 AT 3 ( GT ) gATgGTgGTGGTACjTjAjGCAjTCjAGCT 

to provide a BamHI-Sajl fragment, which was digested with Bam HI and Sail and used to replace the Bam HI- 
Sall fragment of Bam HI-Sall digested pBR322 treated with alkaline phosphatase. After ligation, the plasmid 
pGAPNRS was obtained which was digested with BamHI and Sail to provide a 400bp Bam HI-Sall fragment 
which was gel isolated. This fragment was ligated to an about 900bp Sall -Bam HI fragment containing the 
GAPDH terminator region and a short segment of 3' coding region and the resulting 1.4kb BamHI-BamHI 
fragment digested with Bam HI. The Sall -Bam HI GAPDH terminator fragment was obtained by Sail and 
Bam HI digestion of pGAP2, a plasmid prepared by insertion of an about 3.3kb Bam HI fragment containing 
the GAPDH gene GAP49 (Holland and Holland, supra ) into the BamHI site of pBR322. Plasmids pGAP2 and 
pGAP1 were obtained as follows: A yeast gene library was prepared by inserting fragments obtained after 
partial digestion of total yeast DNA with restriction endonuclease Sau3A in lambda-phage Charon 28 
(Blattner et aj., Science (1977) 196:161-169). The phage library was screened with DNA complementary to 
the yeast GAPDH mRNA and the yeast GAPDH gene from one of these clones was subcloned as either an 
about 3.3kb Bam HI fragment in the Bam HI site of pBR322 (pGAP-2) or as an about 2.1 kb Hindlll fragment 
in the Hindlll site of pBR322 (pGAP-1). 

pBR322 was digested with EcoRI and Sail, the termini blunt-ended and ligated to Bam HI linkers, 
followed by Bam HI digestion and the Bam HI -Bam HI 3.8kb fragment gel isolated, recircularized by self- 
ligation, cloned and designated pBRAR1-Sal. The 1.4kb Bam HI- Bam HI fragment was inserted into the 
BamHI-digested, alkaline phosphatase treated pBRARl-Sal vector to provide the plasmid pPGAP of about 
5.3kb with the orientation in the opposite direction of the amp f . 

The plasmid phSOD was prepared as follows: 

Molecular cloning of hSOD cDNA. 

Total RNA was prepared from an adult human liver by the guanidinium thiocyanate/lithium chloride 
method (Cathala et ah, DNA (1983) 2:329-335). polyA RNA was used to synthesize double-stranded cDNA 
(Maniatis et a|., Molecular Cloning , 213-242, Cold Spring Harbor, 1982) and this was passed over a 
SepharoseG CL4B column to enrich for cDNAs of greater than 350bp (Fiddes and Goodman. Nature (1979) 
282:351-356). The cDNA was inserted at the Pstl site of plot4, a pBR322 derivative having the following 
sequence replacing the Pstl-EcoRI site. 

Pst l Hinfl Alul 

1 GGTGAATCCGTAATCATGGTCATAGCTGTTTCCTGTGTGAAATTGTTATCCGCTC 
ACGTCCACTTAGGCATTAGTACCAGTATCGACAAAGGACACACTTTAACAATAGGCGAG 

H£hl 

Hindi I Alul 

60 ACAATTCCACACATTATACGAGCCGATGATTAATTGTCAACAGCTCATTTCAGAATATTT 
TGTTAAGGTGTGTAATATGCTCGGCTACTAATTAACAGTTGTCGAGTAAAGTCTTATAAA 

EcoRI 

120 GCCAGAACCGTTATGATGCGG 

CGGTCTTGGCAATACTACGCCTTAA 



The cDNA insertion employed the oligo-dGtdC tailing method (Maniatis et aj., supra ). E. coli strain D1210 
was transformed with this mixture and transformants selected on L-agar containing {bug/ml tetracycline 
(Kushner, S.R. (1978) In: Genetic Engineering , eds. Boyer, H.B. and Nicosia, S. t (Elsevier/North Holland, 
Amsterdam) p. 17). Plasmid DNA constituting a liver cDNA library was prepared (Maniatis et al., Molecular 
Cloning, pp. 86-94, Cold Spring Harbor 1982) directly from approximately 62,000 recombinant colonies 
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plated at a density of approximately 3.000 colonies per 9cm diameter Petri dish. 
Isolation of r-hSOD clones . 

s Strain D1210 was retransformed with the liver cDNA library and about 40,000 clones were grown on 
nine 14cm diameter Petri dishes. After transfer of the colonies to nitrocellulose paper and chloramphenicol 
amplification of plasmid DNA, the cells were lysed and the filters prepared for hybridization (Ish-Horowicz 
and Burke, Nucleic Acids Research (1981) 9:2989-2998). Oligonucleotide probes were employed for 
screening by hybridization, with the probes consisting of enzymatically-radiolabeled, chemically-synthesized 

w DNA molecules complementary to the mRNA encoding amino acid residues 19 to 24 of the protein 
(Jabusch et al., supra .; Barra et a}., supra .); the mixture had the following sequences: 



3 1 TTA AAA CTT GTT TTT CT 5 1 
G G C C C 



where all of the indicated possibilities for encoding the peptide sequence were prepared (32-fold degen- 
erate). 

20 The probes were labeled with to a specific activity of l-SxIO^cpm/ug and Millipore (0.45um) filtered 
before use. Filters were prehybridized for 6hrs at 30 *C in 4x SSC, 2x Denhardts's solution, 40m M sodium 
phosphate, pH 7.5, 300ug/ml sonicated salmon testes DNA. Hybridization was for 20hrs at 30 • C in the 
same solution containing 2x10*cpm/ml hSOD DNA probe (residues 19-24). Fitters were washed in 4x SSC, 
once for 15m in at r.t. and twice for 15m in at 30 'C, blotted dry and autoradiographed with an intensifying 

26 screen for 24hrs at -70 • C. 

Areas on the master plates that corresponded to duplicate positive signals were picked into L-broth and 
plasmid DNA prepared by the miniscreen procedure (Maniatis et aj., Molecular Cloning , 178, 368-369, Cold 
Spring Harbor 1982). This DNA was cut with Pstl and subjected to Southern blot analysis (Southern, J. Mol. 
Biol . (1975) 98:503-517) hybridizing initially with the previous labeled probes (amino acid residues 19-24) 

30 and then with additional radiolabeled probes derived from amino acid residues 109-114 and having the 
following sequences (all possible variations, 72-fold degenerate) present as a mixture: 



3' CTA GTA ACA TAA TAA CC 5' 
G G G G G 

T T 

One plasmid pool (pSOD1) contained a cDNA insert of 520bp that hybridized with both probes and after 
40 colony purification, plasmid DNA was prepared from this clone and sequenced by the method of Maxam 
and Gilbert (Proc. Natl. Acad. Sci. USA (1977) 74:560-564). The hSOD cDNA clone pSOD1 constitutes the 
coding region for amino acids 10-153 of hSOD, a single translation^ stop codon and a 3' untranslated 
region. Therefore, in the expression vector construct, the base sequence of the region encoding amino 
acids 1-9 is derived from the published amino acid sequence of hSOD (Jabusch et al., supra ; Barra et al., 
45 supra ) and synthesized chemically as a part of the variable linker segment (see discussion relating to"Rg 
3). 

Construction of plots derivatives containing r-hSOD . 

so The synthetic DNA molecules F(26). C(16), B(31), D(11), E(13) and 4(24) shown in Fig. 3, were 
synthesized by the phosphoramidite method. 

The single strand 4(24) was prepared by using all four bases, at each site where X is indicated. 
Furthermore, silica was withdrawn from the synthesis of the 24mer, such that single-stranded 21mers, 
22mers. and 23mers are obtained in addition to the 24mers. After removal from the silica support, the four 
55 mixtures are combined in appropriate proportions to provide for equimolar amounts of each of the possible 
single strands. This mixture was treated as a single product in the subsequent steps. 

Molecules F(26), C(16), B(31) and D(11) were mixed together in equimolar amounts and 10ug 
phosphorylated using T4 polynucleotide kinase. After phenol-ether extraction, the additional non- 
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phosphorylated synthetic DNA molecules 4(24) and E(13) were added, such that all fragments were 
equimolar. The equimolar mixture contained 13ug of DNA in 133ul of 0.3x kinase buffer. 

After annealing by cooling at a uniform rate from 70 *C to 20 *C over 60min, the single strands were 
ligated together with T4 ligase in 200ul ligation mix at 14* C for 4hrs, phenol-chloroform extracted, ethanol 

5 precipitated and the S'-ends of 4(24) and E(13) phosphorylated using T4 polynucleotide kinase (Maniatis et 
al., supra ). Preparative polyacrylamide gel electrophoresis was used to isolate the completely ligated 53bp 
material having 5'- and 3'-overhangs. 

The above purified fragment mixture was then ligated to the 460bp Tagl-Pstl segment of the hSOD 
cDNA as shown in Fig. 3. This segment was itself constructed by isolating the 454bp Tagl-Alul hSOD 

ro fragment, making it flush-ended using Klenow and inserting it into plots between its Eco RI and Sail sites 
which had been similarly made flush-ended. After preparation of plasmid DNA from this recombinant, the 
460bp Tagl-Pstl hSOD fragment was isolated by preparative polyacrylamide gel electrophoresis. After 
extraction and precipitation, the 515bp fragment resulting from the joining of the synthetic fragment to the 
460bp Tagl-Pstl hSOD fragment was blunt-ended (525-528bp) and then digested with Sail and the resulting 

75 519-522bp hSOD fragment isolated by polyacrylamide gel electrophoresis. This fragment was then inserted 
into plots which had been digested with Pvull and Sail and then treated with alkaline phosphatase. The 
resulting plasmids were used to transform strain D1210. Recombinants obtained after transformation of 
strain D1210 were selected on L-agar containing I00ug/ml ampicillin to give a set of clones, which were 
screened for an Ncol site. One was selected and designated phSOD. 

20 

Construction of a yeast vector for SOD expression . 

The plasmid phSOD was digested with Ncol and Sail and a 550bp fragment obtained, which included 1 
nucleotide untranslated at the 5'-terminus and the entire coding region for hSOD. pPGAP was digested with 
25 Ncol and Sail followed by treatment with alkaline phosphatase and the Sall-Ncol fragment substituted for 
the Ncol-Sall fragment in pPGAP to provide pPGAPSOD. Bam HI digestion of pPGAPSOD resulted in a 2kb 
fragment which was gel isolated and inserted into the Bam HI site of pC1/1 and pCl/1 GAL4/370. These 
plasmids were transformed into yeast strain 2150-2-3 as described previously, with the results of expression 
set forth in the following Table 2. 

30 

Table 2 



Expression of Human SOD in Yeast Strain 2150 


Plasmid 


Carbon Source 


SOD 2 ug/mg protein 


pC1/1 




0 


pCl/1 GAPSOD 


g.L 


148 


PC1/1GALGAPSOD 


g. l 


0.4 




gal 


68 



1 All cultures grown in Minus Leucine media with 2% lactic acid, 3% glycerol 
with or without 2% galactose to late log or early stationary phase. 

2 Determined by RIA. 



hSOD levels were measured using a standard radioimmunoassay with iodinated authentic hSOD as 
standard. Constitutive synthesis from the GAP promoter leads to very high levels of hSOD production, of 
the order of 10-30% of the total cell protein. The induction with galactose works almost as well, yielding 
about 7% of the cell protein as hSOD. 

Construction of pYASM 

This yeast expression plasmid contains the hSOD gene fused to the amino terminus of the human 
proinsulin gene, with a methionine codon at the junction between both genes. The fusion gene is under 
control of the hybrid inducible ADH2-GAP (yeast alcohol dehydrogenase 2) promoter and the GAP 
terminator. An about 3kbp Bam HI expression cassette was constructed. 

The ADH2 portion of the promoter was constructed by cutting a plasmid containing the wild type ADH2 
gene (plasmid pADR2, see Beier and Young, Nature (1982) 300:724-728), a pBR322 derivative, which 
contains the yeast centromere gene from chromosome 3 (CEN3) (U.S. Patent No. 4,464,472), the Trg1 
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gene and the yeast ADHH gene with the restriction enzyme EcoR5. which cuts at a position +66 relative to 
the ATG start codon, as well as in two other sites in pADR2, outside of the ADH2 region in the CEN 3 and 
Trp 1 gene. The resulting mixture of a vector fragment and two smaller fragments was resected with Bal31 
exonuclease to remove about 300bp. Synthetic Xhol linkers were ligated onto the Bal31 treated DNA. The 

5 resulting DNA linker vector fragment was separated from the linkers by column chromatography, cut with 
the restriction enzyme Xho l, religated and used to transform E. coli to ampicillin resistance. The positions of 
the Xhol linker additions were determined by DNA sequencing. One plasmid which contained an Xhol linker 
located within the 5' non-transcribed region of the ADH2 gene (position -232 from ATG) was cut with the 
restriction enzyme Xhol. treated with nuclease S1, and subsequently treated with the restriction enzyme 

io EcoRI to create a linear vector molecule having one blunt end at the site of the Xhol linker and an EcoRI 
end at the pBR322 proximal terminus. 

The GAP portion of the promoter was constructed by cutting plasmid pPGAP (supra) with the enzymes 
Bam HI and EcoRI, followed by the isolation of the 0.4Kbp DNA fragment. The purified fragment was cut 
with the enzyme Alul to create a blunt end near the Bam HI site. 

75 Plasmid pJS0l4 was constructed by the ligation of the Alul-EcoRI GAP promoter fragment to the ADH2 
fragment present on the linear vector described above. 

Plasmid pJS104 was digested with Bam HI (which cuts upstream of the ADH2 region) and with Nco l 
(which cuts downstream of the GAP region). The about 1.3Kbp fragment containing the ADH2-GAP 
promoter was gel purified and ligated to an about 1 .7Kbp fragment containing the hSOD-proinsulin fusion 

20 DNA sequences and GAP terminator present in pYSM. This 3Kbp expression cassette was cloned into 
Bam HI digested and phosphatase treated pCI/1 to yield pYASH. 

Plasmid pYSM was constructed as follows. Three fragments were employed which involve a 454bp 
Nco l -Sau 3A isolated from phSOD, where the fragment includes the entire coding sequence for human 
superoxide dismutase (hSOD) with the exception of the last three 3'- codons; a 51 bp Sau3A-Hindlll 

25 synthetic adapter, which codes for the last three codons of hSOD, methionine, and the first 14 codons of 
proinsulin; and a 231 bp Hindlll-Sall fragment, isolated from plNS5 (a pBR322-derived vector which contains 
a proinsulin coding sequence chemically synthesized according to the amino acid sequence reported by 
Bell et aJ., Nature (1979) 282:525-527), which encodes proinsulin excepting the first 14 amino acids. These 
fragments were ligated together and introduced into the plasmid pPGAP, which had been previously 

30 digested with Ncol and Sail and alkaline phosphatase treated. The resulting plasmid pSIt was digested with 
Bam HI to provide an expression cassette which was cloned into plasmid pCl/1 to yield pYSM. 

' Cloning of alpha-1 -antitrypsin . 

35 A cDNA library was made from 10ug of polyA + RNA isolated from a part of a human liver. This library 
was prepared by oligo-dT priming of the first cDNA strand and self-priming of the second cDNA strand. The 
ds cDNA was size fractionated on a Sepharose® CL4B column and those molecules greater than 300bp 
isolated. This fraction was treated with nuclease S1 and tailed with dCTP, using terminal transferase. The 
tailed cDNA was annealed to pBR322 which had been digested with Pstl and tailed with dGTP. Transforma- 

40 tion of E. coli HB101 yielded 60,000 colonies, where greater than 90% of the clones were recombinant. 

Two synthetic oligonucleotide probes were used to isolate the alpha-1 -antitrypsin (ai -AT) cDNA, the 
first probe corresponding to amino acid residues 344-350 near the C-terminus of the protein was used to 
probe 5,000 colonies and the second probe, corresponding to amino acid residues -23 to -17 (+1 being the 
first nucleotide of the first codon of the mature an -AT) of the signal peptide, was used to probe 25,000 

45 colonies. The probe sequences were taken from the partial nucleotide sequence described by Kurachi et 
a}., Proc. Natl. Acad. Sci. USA (1981) 78:6826; Leicht et al., Nature (1982) 297:655). Approximately 3% of 
the colonies hybridized to the C-terminal probe and four hybridized to the N-terminal probe. The four N- 
terminal clones and 12 C-terminal clones were isolated and subjected to restriction analysis. From these, 
three overlapping clones which cover the entire cDNA were subjected to further study and were used to 

so construct the full length cDNA clone. 
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The entire sequence of a composite full length cDNA derived from the three plasmids is as follows: 



w 



is 



20 



25 



30 



35 



40 



45 



6 
u 

12 

6 
18 

6 
2k 



10 
30 

12 
36 

1U 
12 

16 
48 

Id 
5*i 



20 
60 



22 
66 



24 
72 

26 
73 



26 
84 



i 



3" 
102 



38 



'Met Pro Ser Sep 
CGGCCGGCGGAGGCTAATCCACA ATC CCC TCT TCT 



-20 



-10 



-1 



BamHl 



Pro Thr Phe Asn Lys lie Thr Pro Asn Leu Ala Clu Phe Ala Phe Ser Leu Tyr Arg Gin 
CCA ACC TTC AAC AAG ATC ACC CCC AAC CTC CCT GAG TTC GCC TTC AGC CTA TAC CGC CAG 



Phe Ala Met Leu Ser Leu Gly Thr Lys Ala Asp Thr His Asp Glu lie Leu Glu Gly Leu 
TTT CCA ATC CTC TCC CTC CGC ACC AAG OCT GAG ACT CAC GAT GAA ATC CTG GAG CGC CTG 

In He His Glu Gly Phe Gin Glu Leu Leu 
AG ATC CAT GAA CGC TTC CAG GAA CTC CTC 



Arg(a ,c) 
His Thr ! 
CAT ACC < 

Ser Clu Gl 
ACC CAC 



gg£ 



Val Glu Lys Gly 
GTC GAG AAG OCT 

Val Phe Ala Leu 



Lys Asp 
AAC CAC 



Met 
ATC 



Lys Arg Leu Gly 
AAG CCT TTA CGC 



Leu Met Lys Tyr Leu CI 
CTG ATG AAA TAC CT 



Arg 
A OA 



Arg 
ACC 

Val(a,c) 



Ser He Leu Gl 
ACC ATC CTC CC 



Asp Clu Lys 
CAC CAC AAA 



Gly 

CCC 



ACT 



Pro Pro Glu Val I 
CCC CCC CAC CTC , 
— — Aval 



Leu 
CTC 


Leu 
CTG 


Leu 
CTG 


Ala 
CCA 


Gly 
GGC 


Asp 
CAT 


Aid 
OCT 


Ala 

GCC 


Gin 
CAG 


Lys 
AAG 


lie 

ATC 


Thr 
ACC 


Pro 

CCC 


Asn 
AAC 


Leu 
CTG 

w a w 


Asn 
AAC 


Ser 
AGC 


Thr 
ACC 

n ^0 w 


Asn 
AAT 


He 
ATC 


Leu 
CTG 


Gly 
CCC 


Thr 
ACC 


Lys 
AAG 


Ala 

GCT 


Glu 
GAG 


lie 
ATT 


Pro 

CCG 


Glu 
GAG 


Ala 

GCT 


Pro 

CCA 


Asp 

CAC 


Ser 
AGC 


Gin 
CAC 


Leu 
CTC 


Leu 
CTA 


Val 

GTC 


Asp 

GAT 


Lys 
AAG 


Phe 
TTT 


Asn 
AAC 


Phe 
TTC 


Gly 
CGC 


Asp 
CAC 


Thr 
ACC 


Gin 
CAA 


Cly 
GGC 


AAA 


lie 
ATT 


Val 
GTC 


Asn 

AAT 


Tyr 
TAC 


He 
ATC 


Phe 
TTC 


Phe 
TTT 


Glu 
GAG 


Asp 
CAC 


Phe 

TTC 


His 
CAC 


Val 
CTG 


Met 
ATG 


Phe 
TTT 


Asn 

AAC 


lie 

ATC 


Gin 
CAG 


Gly 
CGC 


Asn 
AAT 


Ala 

GCC 


Asn(c) 
Thr Ala 
ACC GCC 


Glu 
CAA 


Leu 
CTC 


Thr 
ACC 


His 
CAC 


Asp 
GAT 


Leu 
TTA 


His 
CAT 


Leu 
TTA 


Pro 

CCC 


Lys 
AAA 


Leu 
CTG 


Gly 
GCC 


He 
ATC 


Thr 
ACT 


Lys 
AAG 


Pro 
CCC 


Leu 

CTC 


Lys 
AAG 


Leu 
CTC 


Ser 
TCC 


Glu 
GAA 


Ala 

GCT 


Ala 

GCT 


Cly 
CCC 


Ala 

GCC 


Phe 
TTC 


Asn 
AAC 


Lys 
AAA 


Pro 
CCC 


Phe 
TTT 


Civ 
CCA 


Lys 

AAA 


Val 
CTC 


Val 
CTC 


Asn 
AAT 



Leu Cys Cys Leu Val Pro Val Ser Leu Ala 



Asp Gly(c) 
\r Gly Asn Gly 
:C GCC AAT CGC 



Leu Phe Leu 

CTG TTC CTC 



X 



AAC AAA CAG ATC AAC CAT TAC 



1g£ 



AAA TCC GAG ACA CCC TTT GAA GTC 



Ala(b) 



BstEII 



Lys 
AAC 



Val Pro Met 
CTG CCT ATG 



Cys 
TCT 



Lys Lys Leu Ser Ser Trp Val Leu 
AAG AAC CTG TCC AGC TCC CTG CTG 



He Phe 
ATC TTC 


Phe 
TTC 


Leu 
CTG 


Pro 
CCT 


Asp 
GAT 


He 
ATC 


He Thr 
ATC ACC 
EcoRV 


Lys 
AAC 


Phe 
TTC 


Leu 
CTG 


Leu Ser 
CTG TCC 


He 
ATT 


Thr 
ACT 


Gly 
CCA 


Thr 
ACC 


Val Phe 
GTC TTC 


Ser 
AGC 


Asn 
AAT 


Cly 
CCG 


Ala 

GCT 


Lys 
AAG 


Ala 

CCC 


Val 
CTG 


His 
CAT 


Lys 
AAG 


Ala 

GCT 


Met 
ATG 


Phe 
TTT 


Leu 
TTA 


Glu 
GAG 


Ala 

GCC 


He 
ATA 


Val 
CTC 


Phe 
TTC 


Leu 
TTA 


Met 
ATC 


He 
ATT 


Clu 
CAA 



ACC TAT GAT CTG AAG 



Ala Asp Leu Ser Gly 
GCT CAC CTC TCC GC6 



CCC 



Met Ser 
ATC TCT 



]Ile 
ATC 



50 



AAT CCC ACC CAA AAA TAC 

CCC TCC CTT TTT ATC ACCT 

Sail 



l.yjl CCCTCCCCTCCATCCCTCCCCCCCTCCCTCCATCACATTAAACAACCCTTCACCTCCAAAAAAAAAAAAAAAAAAAAAA 



55 
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LEGEND 



Nucleotide and predicted amino acid sequences of <*i-AT cDNA. The reactive center met-ser at 
positions 358-359 is boxed. Subscripts to amino acids in parentheses identify differences between the 
5 subject protein sequence and those derived from (a) protein sequencing (Carrell et al., 1982). (b) the cDNA 
of Woo et al., [see Carrell et al., 1982]), and (c) the cDNA of Bollen et al., 1983. The synthetic DNA 
molecules used in the construction of the Bam HI to Sail fragment encoding the mature protein are shown as 
are the cDNA restriction sites used in this construction. 

The above sequence was determined using the dideoxy sequencing method of Sanger et al., Proc. 
w Natl. Acad. ScL USA (1977) 74:5463, in the M13 vectors of Messing et aj. t Nucleic Acids Res. (1981) 9:309. 
The differences at the nucleotide and amino acid level from the published cDNA sequences are shown. 

Construction of the full length clone for expression of yeast began with three fragments isolated from 
cDNA clones: 1) a 630bp Bam HI-BstEII fragment; 2) a 450bp BstEII-Aval fragment; and 3) an 85bp Ava l- 
Hinfl fragment. A synthetic adapter was employed having the following sequence: 

75 

A 2 TC 3 AC 3 A 5 TAG 

G 3 TG 3 T 5 ATCAGCT 

20 



Approximately two pmoles of fragments 1 and 2 were ligated together and after removal of the ligase, 
digested with Bam HI and Aval. Fragment 3 and the synthetic adapter were ligated and digested with Ava l 
and Sail and the two resulting fragment mixtures were ligated followed by digestion with Bam HI and Sai l. 
25 Fragments migrating upon electrophoresis in the region of about 1 000-1 400bp were isolated and cloned by 
substitution into Bam HI and Sail digested and alkaline phosphatase treated pBR322. The resulting plasmid 
is referred to as pATi. 

Plasmid pPGAP was digested with Ncol, followed by blunt-ending, followed by Sail digestion and 
treatment with alkaline phosphatase. The Nco l-Sall fragment was substituted with an approximately I250bp 

30 blunt-ended (BamHI)-SaJI fragment obtained from plasmid pATi, by Bam HI digestion, blunt ending, and Sail 
digestion. This was inserted into the pPGAP vector to produce the plasmid pGAPATi, a 6.6kb plasmid, 
which was digested with Nco l and Bam HI and a 2.3kb Ncol- Bam HI fragment obtained having the en -AT 
-gene and the GAPDH terminator and approximately 400bp BamHI-Ncol fragment obtained having the 
GAPDH promoter. These fragments were ligated together and inserted into the Bam HI site of pC1/1. The 

35 plasmids pCl/1GAPATi8 and pC1/1GAPATi9 were obtained with the orientation of expression clockwise in 
the former and counterclockwise in the latter, with amp r being in the counterclockwise direction. These 
plasmids were transformed in S. cerevisiae AB103 {A.T.C.C. No. 20658, deposited January 5, 1983) by 
standard methods, selecting for leucine prototrophy and grown as described above. Yeast extracts were 
prepared by lysis with glass beads and the en -AT activity determined by inhibition of human leukocyte 

40 elastase. 

Assays contained in 1ml:0. 1-0.2 human leukocyte elastase (HLE); 0.1 mM MeO-Suc-Ala-Ala-Pro-Val-g- 
nitro-anilide (Beatty et a}., J. Biol. Chem. (1980) 255:3931); 50mM Tris. pH 8, 0.5M NaCI, and the indicated 
amounts of yeast extract or human ai -AT. Assays were initiated by the addition of elastase, incubated at 
28 # C for 15min. terminated by the addition of 100ul of 8N acetic acid and the absorbance at 410nm 
45 determined. Typical results are shown in the following Table 3. 
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The above data demonstrate that plasm ids having the orientation of the expression cassette in the 
55 counterclockwise orientation, the promoter proximal to the long sequence of pBR322, make 10-20 times 
more on -AT than the same cassette in the other orientation. 
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Yeast strain AB110 . 

Yeast strain 2150-2-3 was crossed with a yeast strain AB103 transformant containing pC1/1GAPATi9. 
The diploids were sporulated and the tetrads dissected. Strains were maintained on leucine selective plates 
5 in order to ensure maintenance of the plasmid, since the parents are auxotrophs. A series of colonies were 
screened for their genotype with respect to a number of markers. The most vigorous strains were selected 
and cultures grown on leucine selective media. The best strain was designated AB110 (pC1/1GAPATi9), 
gave 6-7.5% of the total cell protein as en -AT as shown in the above Table 3. The strain AB110 has the 
following genotype: Mate, ura3-52, leu2-04 or both leu2-3 and Ieu2-112 t pep4-3, his4-580 (cir • ). 

10 

Phosphate induction . 

Plasmid pPGT80 was digested with BamHI, the ends blunt-ended, followed by digestion with Xbal and 
the 500bp fragment containing the GAPDH promoter and 5'-end of the sAg gene isolated. 

75 The PH05 gene was isolated from a yeast genomic library employing an oligonucleotide probe 5'- 
GGCACTCACACGTGGGACTAG-3* derived from the published partial sequence (Meyhack et al., The 
EMBO Journal (1982) 1375-680). a subfragment of this clone containing 550bp of the S'-untranslated 
region and approximately 80bp of coding sequence was subcloned as a Bam HI-Sall substitution in pBR322 
to provide pPH05. This fragment has the sequence 5'-ATGTTTAAA-3\ encoding the first three amino acids. 

20 the second and third codons specifying an Ahalll site. The plasmid pHBS6 was digested with Ncol, blunt- 
ended, followed by digestion with Bam HI and treatment with alkaline phosphatase. The PH05 promoter 
region was obtained by digesting the pPHOS plasmid with Ahalll, resecting the resulting fragment with 
Bal31 for a short time, followed by digestion with Bam HI and isolation of a 50O550bp Bam HI blunt-ended 
fragment. This fragment was employed for substitution of the Ncol-BamHI fragment from pHBS6 and was 

25 screened for regeneration of the Nco l restriction site to provide plasmid pHBS6PH05/1 . 

Plasmid pHBS6PH05/1 was digested with BstEII which cleaves at position -175 in the PH05 promoter. 
This molecule was blunt-ended, digested with Sail and the 650bp fragment having the 5'-portion of the 
promoter domain, containing 275bp of pBR322 and 375bp of the PH05 promoter region isolated. This 
fragment was ligated with the blunt-ended (Bam HI)-Xbal fragment obtained from digestion of pPGT80 with 

30 Bam HI, blunt ending, followed by Xbal digestion. After digesting the ligated fragment with Sail and Xbal, the 
resulting fragment was then substituted into pPGT16-3 which had been digested with Sail and Xba l and 
treated with alkaline phosphatase. The resulting plasmid pPHO5PGT80 had a cassette comprising the 
PH05 regulatory region, the GAPDH promoter, the sAg gene and the ADH terminator. This cassette was 
excised from the plasmid by Bam HI digestion, whereby a 1.8kb Bam HI -Bam HI fragment was gel isolated 

35 and ligated into the Bam HI site of Bam HI digested and alkaline phosphatase treated pCl/1 to provide 
plasmids PH05GAP1 and PH05GAP2 where the PH05 was distal and proximal to the long pBR322 
sequence, respectively. 

The two plasmids were transformed into yeast strain 2150-2-3 as described above and grown in rich 
media as described above for 8 to 10 generations in either high (7mM) or low (0.2mM) phosphate. Samples 
40 were harvested in late log phase and HBsAg determined as described previously. The results are shown 
below in Table 4. 



Table 4 



45 


Regulation of HBsAg Production in Yeast using a Hybrid PH05/GAPDH Promoter. 




Construction 


High Phosphate (7mM) 


Low Phosphate (0.2mM) 


Induction low / high 






(sAg ug/mg protein) 




50 


PH05GAP-1 


0.08 


0.95 


12.0 




PH05GAP-2 


0.27 


0.40 


1.5 



From the above results, it is evident that effective regulation with phosphate is obtained, with one 
orientation being superior to the other. 

It is evident from the above results that highly efficient expression can be obtained, either constitutive or 
regulated, by providing for truncated promoter regions of yeast glycolytic enzyme gene promoters, 
employing the 3' domain proximal to the coding region of the gene in conjunction with a 5 f -portion or 
second domain of the promoter region of a yeast gene subject to inducible regulation by a nutrient, e.g.. 
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carbon source or phosphate, temperature, or other externally controllable source or condition. Alternatively, 
the second domain may be replaced by prokaryotic sequences of at least about 1kb or greater, which 
provide for constitutive enhancement in the absence of the second domain of the promoter region. Thus, a 
wide variety of genes exogenous to yeast may be expressed in high yield in high percentages of the total 
5 protein of the yeast host. 

Although the foregoing invention has been described in some detail by way of illustration and example 
for purposes of clarity of understanding, it will be obvious that certain changes and modifications may be 
practiced within the scope of the appended claims. 

to Claims 

1. A method for obtaining expression of a foreign structural gene in a yeast host employing hybrid 
constructions involving transcriptional and translations regulatory signals recognized by said host 
joined to a foreign gene for transcriptional and translational regulation of expression in said host, 

75 wherein said construction is introduced into said host and said foreign gene is expressed to produce 
the encoded polypeptide, comprising: 

employing for transcriptional regulation, a DNA sequence having first and second domains, 
said first domain being proximal to said structural gene, being of less than about 600bp and having a 
sequence which includes the RNA polymerase binding site and transcription initiation site of a yeast 
20 glyceraldehyde-3-phosphate dehydrogenase gene; 

said second domain being proximal to said first domain and distal to said structural gene, providing for 
enhanced efficiency of transcription and being the regulatory region of a regulated yeast gene other 
than the natural gene of said first domain; and 
said regulatory region being the PH05 regulatory region. 

25 

2. A method according to Claim 1, wherein said polypeptide is Hepatitis B surface antigen, superoxide 
dismutase, or on -antitrypsin. 

3. A method according to claim 1, wherein said regulatory region is the ADHII regulatory region. 

30 

4. A method for obtaining expression of a foreign polypeptide in a yeast host employing a hybrid 
construction involving transcriptional and translational regulatory signals recognized by said host joined 
to a foreign gene for transcriptional and translational regulation of expression in said host, wherein said 
construction is introduced into said host and said foreign gene is expressed to produce the encoded 

35 polypeptide, comprising employing for transcriptional regulation, a DNA sequence having first and 
second domains, 

said first domain being proximal to said structural gene, being of about 200-500bp, being derived from 
the S'-untranstated region of the yeast glyceraldehyde-3-phosphate dehydrogenase or pyruvate kinase 
gene and extending downstream to at least nucleotide -10; and 
40 said second domain being distal to said first domain and proximal to said structural gene, being a 
prokaryotic sequence of at least about lOOObp, wherein said yeast cells containing said construct are 
grown to high density; and 
isolating said polypeptide. 

45 5. A method for preparing a polypeptide in a yeast host, where the polypeptide is heterologous to the 
yeast and may be produced in tow percentage amounts of total protein, comprising: 
joining an open reading frame DNA sequence coding for said polypeptide with a second open reading 
frame DNA sequence coding for heterologous human superoxide dismutase to form a fusion polypep- 
tide; 

50 introducing the sequence coding for said fusion polypeptide under conditions for expression in said 
yeast which includes an inducible transcriptional initiation regulatory region that consists essentially of, 
reading in a 5' to 3* direction, a yeast glycolytic enzyme promoter region and adjacent alcohol 
dehydrogenase-2 control region, whereby said fusion polypeptide is expressed; 
isolating said fusion polypeptide to provide the polypeptide in high yield. 

55 
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Patentansprtlche 

1. Verfahren zum Erhalt eines fremden strukturellen Gens in einem Hefewirt unter Verwendung von 
hybridan Konstrukten. die transkriptionelle und translatorische, von dem Wirt erkannte Regulationssi- 

s gnale. welche mit einem Fremdgen zur transkriptionellen und translatorischen Regulation der Expres- 
sion in dem Wirt verbunden sind, einschlieBen, wobei das Konstrukt in den Wirt eingefUhrt und das 
Fremdgen zur Bildung des kodierten Polypeptids exprimiert wird, 
gekennzeichnet durch 

die Verwendung einer DNA-Sequenz mit ersten und zweiten DomSnen zur transkriptionellen Regulation, 
io wobei die erste DomSne sich proximal zu dem Strukturgen befindet, weniger ais etwa 600 bp hat und 
eine Sequenz aufweist. die die Bindungsstelle der RNA-Polymerase und die 
Initiationsstelle 

der Transkription eines Hefe-Glyceraidehyd-3-phosphat-dehydrogenasegens einschlieBt; und wobei sich 
die zweite Domane proximal zu der ersten DomSne sowie zu dem Strukturgen befindet, die verbesserte 
75 Effizienz der Transkription bewirkt und die Regulationsregion eines regulierten Hefegens, das ein 
anderes als das natUrliche Gen der ersten Domane ist und die Regulationsregion die PH05-Regula- 
tionsregion ist. 

2. Verfahren nach Anspruch 1 ( bei dem das Polypeptid das Hepatitis-B-OberflSchen-Antigen, Superoxid- 
20 dismutase oder alphai -Antitrypsin ist. 

3. Verfahren nach Anspruch 1, bei dem die Regulationsregion die ADHII-Regulationsregion ist. 

4. Verfahren zum Erhalt der Expression eines Fremdpolypeptids in einem Hefewirt unter Verwendung 
26 eines Hybridkonstrukts, das transkriptionelle und translatorische, von dem Wirt erkannte Regulationssi- 

gnale, welche mit einem Fremdgen zur transkriptionellen und translatorischen Expression in dem Wirt 
erkannte Regulationssignaie einschlieBt, wobei das Konstrukt in den Wirt eingefUhrt und das Fremdgen 
zur Bildung des kodierten Polypeptids exprimiert wird, und bei dem fOr die transkriptionelle Regulation 
eine DNA-Sequenz mit ersten und zweiten DomSnen verwendet wird, 
30 wobei die erste DomSne sich proximal zu dem Strukturgen befindet, weniger als etwa 200 - 500 bp hat 
und von der S'-nichttranslatierten Region des Hefe-Glyceraldehyd-3-Phosphat-Dehydrogenase- Oder 
Pryruvatkinase-Gens abgeleitet ist und sich stromabw&rts bis mindestens zu dem Nucleotid -10 
erstreckt, und 

die zweite Domane sich distal zu der ersten Domane und proximal zu dem Strukturgen befindet, 
35 welches eine prokaryotische Sequenz von mindestens etwa 1000 bp ist, wobei die das Konstrukt 
enthaltenden Zellen bis zu einer hohen Dichte gezUchtet werden, und 
das Polypeptid isoliert wird. 

5. In einem Verfahren zur Herstellung eines Polypeptids in einem Hefewirt, wobei das Polypeptidheterolog 
40 zu der Hefe ist und in niedrigen prozentualen Mengen des Gesamtproteins hergestellt wird, eine 

Verbesserung, die folgendes umfaBt: 

VerknUpfung einer DNA-Sequenz in offenem Leseraster, die fUr das Polypeptid kodiert, mit einer DNA- 
Sequenz eines zweiten offenen Leserasters, welche fUr heterologe menschliche Superoxiddismutase 
kodiert, unter Bildung eines Fusionspolypeptids; 
45 EinfUhrung der fUr das Fusionspolypeptid unter Bedingung der Expression kodierenden Sequenz in die 
Hefe, welche eine induzierbare transkriptionelle Initiations-Regulationsregion einschlieBt, welche im 
wesentlichen aus, gelesen in 5*- 3'-Richtung, eine Promoterregion fur eine glycolytisches Enzym aus 
Hefe und eine benachbarte Alkohol Dehydrogenase-2-Kontrollregion besteht, und 
Isolation des Fusionspolypeptids unter Bildung des Polypeptids in hoher Ausbeute. 

50 

Revendlcatlons 

1. Proc§d6 d'obtention de ('expression d'un g&ne structurel Stranger dans una levure hote employant des 
constructions hybrides faisant appel a des signaux r£gulateurs de transcription et de traduction 
55 reconnus par ledit hote soude* a un gfene Stranger pour la regulation de la transcription et de la 
traduction de ('expression dans ledit hote, dans lequel ladite construction est introduite dans ledit hote 
et ledit g§ne Stranger est exprimg pour produire le polypeptide code\ comprenant : 

I'emploi, pour la regulation de la transcription, d'une sequence d'ADN poss£dant un premier 
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domaine et un second domaine, 

ledit premier domaine etant proximal dudit g&ne structurel, pr§sentant moins d environ 600 po et 
possedant une sequence qui renferme un site de liaison d'ARN polymerase et un site d'initiation de la 
transcription d'un g6ne de levure glyc6raldehyde-3-phosphate d6shydrog6nase; 
5 ledit second domaine etant proximal dudit premier domaine et distal dudit gfene sfructurei. 

permettant d'ameiiorer I'efficacite de la transcription et formant la region regulatrice d'un gfene de 
levure reguie autre que ledit gfcne naturel dudit premier domaine; et 
ladite region regulatrice etant la region regulatrice PH05. 

w 2. Proc6d6 selon la revendication 1 , dans lequel ledit polypeptide est I'antigfcne de surface de I'hepatite 
B, la superoxyde dismutase ou Tai-antitrypsine. 

3. Proc6d6 selon la revendication 1 , dans lequel ladite region regulatrice est la region regulatrice ADHII. 

T5 4. Procede d'obtention de I'expression d'un polypeptide Stranger dans une levure hote employant une 
construction hyrbide faisant appel h des signaux rfgulateurs de transcription et de traduction reconnus 
par ledit h8te soud6 k un gfene Stranger pour la regulation de la transcription et de la traduction de 
(•expression dans ledit hdte. dans lequel ladite construction est introduce dans ledit h8te et ledit g$ne 
Stranger est exprimS pour produire le polypeptide code, comprenant I'emploi. pour la regulation de la 
20 transcription, d'une sequence d'ADN possedant un premier domaine et un second domaine, 

ledit premier domaine etant proximal dudit gfcne structurel, prSsentant environ 200-500 pb, etant 
derive de la region 5'-non traduite du gfene de levure glyceraldehyde-3-phosphate deshydrogenase ou 
pyruvate kinase et se prolongeant en aval au moins jusqu'au nucleotide -10; et 

ledit second domaine etant distal dudit premier domaine et proximal dudit gfene structurel, 6tant 
25 une sequence procaryote d'au moins environ 1000 pb, dans lequel lesdites cellules de levure contenant 
ledit produit de recombinaison sont cultiv6es jusqu'fc une haute density et 
Tisolement dudit polypeptide. 

5. Procede de preparation d'un polypeptide dans une levure h8te. dans laquelle le polypeptide est 
heterologue de la levure et peut etre produit en faible pourcentage de la proteine totale. comprenant : 

la soudure d'une sequence d'ADN & cadre de lecture ouvert codant pour ledit polypeptide avec 
une seconde sequence d'ADN h cadre de lecture ouvert codant pour la superoxyde dismutase 
humaine heterologue, pour former un polypeptide de fusion; 

Introduction de la sequence codant pour ledit polypeptide de fusion, dans des conditions 
d'expression dans ladite levure qui renferme une region regulatrice de I'initiation de la transcription 
pouvant §tre induite, qui consiste essentiellement & lire dans le sens 5' & 3' une region promoteur 
d'enzyme glycolytique de levure et une region adjacente de contrdle de I'alcool deshydrogenase-2. ce 
qui permet d'exprimer ledit polypeptide de fusion; 

I'isolement dudit polypeptide de fusion pour obtenir le polypeptide avec un rendement eieve. 
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